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Abstract 

We describe a method that we believe may be foundational for a comprehensive theory of 
generalised Turan problems. The cornerstone of our approach is a quasirandom counting lemma 
for quasirandom hypergraphs, which extends the standard counting lemma by not only counting 
copies of a particular configuration but also showing that these copies are evenly distributed. We 
demonstrate the power of the method by proving a conjecture of Mubayi on the codegree threshold 
of the Fano plane, that any 3-graph on n vertices for which every pair of vertices is contained in 
more than n/2 edges must contain a Fano plane, for n sufficiently large. For projective planes over 
fields of odd size q we show that the codegree threshold is between n/2 — q + 1 and n/2, but for 
PG2(4) we find the somewhat surprising phenomenon that the threshold is less than (1/2 — e)n 
for some small e > 0. We conclude by setting out a program for future developments of this 
method to tackle other problems. 



1 Introduction 

A famous unsolved question of Turan asks for the maximum size of a 3-grapi0 on n vertices that does 
not contain a tetrahedon iff, i.e. 4 vertices on which every triple is present. Despite the superficial 
similarity to the analogous easy question for graphs (the maximum size of a graph with no triangle) 
this problem has evaded even an asymptotic solution for over 60 years. It may be considered a test 
case for the general Turan problem, that of determining the maximum size of a /c-graph on n vertices 
that does not contain some fixed /c-graph F. This maximum size is called the Turan number of F, 
denoted ex(n, F). It is not hard to show that the limit Tt(F) = lim^^oo ex(n, F)/(Z) exists. As a 
first step to understanding the Turan number we may ask to determine this limit, the Turan density, 
which describes the asymptotic behaviour of the Turan number. There are very few known results 
even in this weaker form, and no general principles have been developed, even conjecturally. 

For < s < k we may define a generalised Turan number ex s (n,F) as the largest number m 
such that there is a /c-graph H on n vertices that does not contain F and has minimum s-degred^| 
S S (H) > m. Note that we recover ex(n,F) in the case s = 0, and the case s = k is trivial. The 
cases s = and s = 1 are essentially equivalent, via a well-known induction argument, so there is 
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no new theory here for graphs. However, for general hypergraphs we obtain a rich source of new 
problems, and it is not apparent how they relate to each other. There has been much recent interest 
in the case s = k — 1, which were called codegree problems in [20]. (See also [17\ [22] for similar 
questions involving structures that are spanning rather than fixed.) We may define generalised 
Turan densities as vr s (i ? ) = lim n _ >00 ex s (n, F) / (tZf) H A simple averaging argument shows that if we 
define the normalised minimum s-degrees of H as 5 S (H) = 5 s (i?)/(?Zs) then we have a hierarchy 
S (H) > <5 x (tf) > • • • > h-i{H), so 7r fc _!(F) < • • • < vr (F). 

Projective geometries PG m (q) provide examples of configurations F that are surprisingly tractable 
for these problems. For the Fano plane (to = q = 2) the exact Turan number for n sufficiently large 
was determined independently and simultaneously by Keevash and Sudakov [13] and Fiiredi and 
Simonovits [6]. They also characterised the unique maximising configuration: a balanced complete 
bipartit^l 3-graph. Earlier de Caen and Fiiredi [2] had obtained the Turan density ir(PG2(2)) = 3/4. 
On the other hand Mubayi [19] showed that the codegree density of the Fano plane is ^(-PG^^)) = 
1/2. He conjectured that the exact codegree threshold satisfies ex2(n, P(?2(2)) < n/2. The following 
result establishes this and characterises the case of equality. 

Theorem 1.1 If n is sufficiently large and H is a 3-graph on n vertices with minimum 2-degree at 
least n/2 that does not contain a Fano plane then n is even and H is a balanced complete bipartite 
3-graph. 

General projective geometries have been studied in pT] (the Turan problem) and [15] (the code- 
gree problem). A general bound ir q (PG rn (q)) < 1 — 1/m, was obtained in [15], and it was shown that 
equality holds whenever m = 2 and q is 2 or odd, and whenever m = 3 and q is 2 or 3. We prove the 
following results, which give quite precise information about the codegree threshold for planes over 
a field of odd size, and demonstrate a surprisingly different behaviour for PG ? 2(4). 

Theorem 1.2 Suppose q is an odd prime power. Then \n/2\ — q + 1 < ex q (n, PG2{q)) < n/2. In 
the case q = 3 and n even we have exz(n, PG2{3)) = n/2 — 1. 

Theorem 1.3 There is e > for which 7r4(PG 2 (4)) < 1/2 - e. 

The main idea in our arguments is a quasirandom counting lemma that extends the (usual) 
counting lemma for quasirandom hypergraphs. We adopt the Gowers framework as being most 
compatible with our argument (there are other approaches to this theory, see Rodl et al. (e.g. 
|25[ I23j ) and Tao [28]), We will give precise definitions later, and for now describe our result on 
an intuitive level. Hypergraph regularity theory gives a method of decomposing a hypergraph into 

3 In the codegree case s = k — 1 this limit was shown to exist in [20]. In general the existence may be deduced from 
a very general theory of Razborov 21 . This is perhaps using a sledgehammer to crack a nut, and in fact the method 
of [20] can be extended using martingale estimates in place of hypergeometric estimates. We will elaborate slightly on 
the martingale aspect in the final section, but a detailed treatment is beyond the scope we have set for this paper. 

4 A fc-graph H is bipartite if there is a partition V(H) = A U B so that there are no edges of H lying entirely within 
A or entirely within B. A complete bipartite fc-graph contains all edges that intersect both A and B. It is balanced if 
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a bounded number of pieces, each of which behaves in an approximately random fashion. The 
number of pieces depends only on the degree of approximation and is independent of the size of the 
hypergraph. In order for such a decomposition to be useful, the notion of random behaviour should 
be sufficiently powerful for applications, and the general criterion that has been used is that there 
should be a counting lemma, meaning a result that a sufficiently random hypergraph contains many 
copies of any small fixed configuration. Our quasirandom counting lemma will state that not only 
are there many copies, but that they are uniformly distributed within the hypergraph. 

We also make use of the idea of stability, a phenomenon which was originally discovered by 
Erdos and Simonovits in the 60's in the context of graphs with excluded subgraphs, but has only 
been systematically explored relatively recently, as researchers have realised the importance and 
applications of such results in hypergraph Turan theory, enumeration of discrete structures and 
extremal set theory (see |12j as a recent example and for many further references). 

The rest of this paper is organised as follows. The next section is expository in nature: it 
introduces the theory needed in later sections for the special case of graphs, where it will be mostly 
familiar to many readers (although our quasirandom counting lemma is new even for graphs). Then 
in section 3 we introduce the Gowers quasirandomness framework for 3-graphs and present a case 
of our quasirandom counting lemma that we will need to prove Theorem 11.11 Section 4 contains 
the proof of Theorem 11.11 using the quasirandomness theory from section 3 and also the method of 
'stability', or approximate structure. In section 5 we present the general theory of quasirandomness 
hypergraphs and the full form of our quasirandom counting lemma: this is the engine behind our 
entire approach. We also give an application to generalised Turan problem for configurations that 
have a certain special form. This general theory is applied in section 6 to the study of codegree 
problems in projective planes, where we prove the other theorems stated above. The final section 
sets out a program for future developments of this method to other generalised Turan problems. 
Since our formulation of the Gowers quasirandomness framework uses some non-trivial variations on 
the original framework, we give justifications for these variations in an appendix to the paper. 

Notation. Write [n] = {1, • • • , re}. If X is a set and A; is a number then ("5) = {Y C X : \Y\ = k}, 
( <k ) = Uj<fe( i J and ( <k ) = Uj<fc( i J. a db b denotes an unspecified real number in the interval 
[a — 6, a + b]. It is convenient to regard a finite set X as being equipped with the uniform probability 
measure P({j;}) = 1/|X|, so that we can express the average of a function / defined on X as 
K x€ xf(x). A fc-graph H consists of a vertex set V(H) and an edge set E(H), each edge being some 
fc-tuple of vertices. We often identify H with E(H), thus \H\ is the number of edges in H. Given 
S C V(H) the neighbourhood of S in H is N H (S) = {T C V(H) \ S : SUT G E(H)} and the degree 
of S is \Nh(S)\. The minimum s-degree 5 S (H) = imaig\ =s \Nh(S)\ is the minimum of \Nh(S)\ over 
all subsets S of size s. Given X C V(H) the restriction is a fc-graph with vertex set X and 

edge set equal to all those edges of H that are contained in X. Suppose F and H are fc-graphs. The 
homomorphism density cIf(H) is the probability that a randomly chosen map (f> : V{F) — > V{H) is 
a homomorphism, i.e. 4>(e) is an edge of H for every edge e of F (which we also write as (ft(F) C H). 
We also use the same notation in a 'partite setting' (this will be explained when it occurs). When 
F = e consists of just a single edge we write d(H) = d e (H) = k\\E(H)\ \V(H)\~ k , and call this the 
density of H. We use the notation < a <C (3 to mean that there is an increasing function f(x) so 
that the ensuing argument is valid for < a < /(/?). 
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2 Graphs: regularity and counting, quasirandomness and quasir- 
andom counting. 



The purpose of this section is expository: we introduce the theory needed in later sections for the 
special case of graphs, where it is considerably simpler, and partly familiar to many readers. 

In the first subsection we describe Szemeredi's regularity lemma [27], one of the most powerful 
tools in modern graph theory. Roughly speaking, it says that any graph can be approximated by an 
average with respect to a partition of its vertex set into a bounded number of classes, the number 
of classes depending only on the accuracy of the desired approximation, and not on the number of 
vertices in the graph. Each pair of classes span a bipartite subgraph that is 'regular', meaning that 
the proportion of edges in any large bipartite subgraph is close to the proportion of edges in the pair 
as a whole. A key property of this approximation is that it leads to a 'counting lemma', allowing 
an accurate prediction of the number of copies of any small fixed graph spanned by some specified 
classes of the partition. We refer the reader to [16J for a survey of the regularity lemma and its 
applications. 

The second subsection discusses quasirandomness of graphs, a concept introduced by Chung, 
Graham and Wilson [4j (see also Thomason [31] for a similar notion). There are many ways of 
describing this concept, all of which are broadly equivalent (up to renaming constants); in fact, it is 
also equivalent to regularity (as described in the first subsection). A particularly simple formulation 
is to call a bipartite graph quasirandom if the number of 4-cycles is close to what would be expected 
in a random graph with the same edge density. A closely related formulation that forms the basis for 
the Gowers approach to quasirandomness in hypergraphs is to say that if we count 4-cycles weighted 
by the 'balanced function' of the graph then the result is small. Our discussion in this subsection 
is based on section 3 of [7J (we are more brief on those points discussed there, but we also provide 
some additional arguments that are omitted there). 

In the third subsection we introduce the graph case of our quasirandom counting lemma, an 
extension of the counting lemma discussed in the first subsection, saying that copies of any small 
fixed graph are well-distributed in the graph. This is a new result even in the special case of graphs, 
and has consequences that are somewhat surprising at first sight. 

2.1 Regularity and counting 

We start by describing the notion of 'regularity' for bipartite graphs. The density of a bipartite 
graph G = (A, B) with vertex classes A and B is defined to be 



We often write d(A, B) if this is unambiguous. Given e > 0, we say that G is e-regular if for all 
subsets X CA and Y Q B with |X| > e\A\ and \Y\ > e\B\ we have that \d(X,Y) - d(A,B)\ < e. 

The regularity lemma says that any graph can be partitioned into a bounded number of regular 
pairs and a few leftover edges. Formally: 
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Theorem 2.1 For every real e > and number rriQ > 1 there are numbers m, uq > 1 so that for any 
graph G on n > hq vertices we can partition its vertices as V(G) = Vq U V% U • • • U V*. so that 

• mo < k < m, 

• \Vq\ < en, 

• \V\ \ = 1 1^2 1 = ■ • • = \Vk\, and 

• (Vi, Vj) spans an e-regular bipartite subgraph of G for all but at most ek 2 pairs 1 < i < j < k. 

Remarks. In applications one takes k > mj > e -1 so that the number of edges within any Vi is 
negligible. An 'exceptional class' Vq is allowed so that the remaining partition can be 'equitable', 
i.e. \Vi\ = \Vz\ = ■ • ■ = \Vk\. If one prefers not to have an exceptional class then its vertices may be 
distributed among the other classes to obtain a partition with the same regularity properties for a 
slightly larger e with the class sizes differing by at most one. We refer the reader to Section 1 of [16] 
for further discussion of variants of the regularity lemma. 

A key property of regularity is that we can accurately count copies of any small fixed graph. 
For the purpose of exposition we state two simple cases, and then the general 'counting lemma'. 
Throughout we assume a hierarchy < 1/n <C e <C d, that every density we consider is at least d 
and each part in our graphs contains at least n vertices. The following well-known statements can 
be proved using similar arguments to that given for Lemma 2.1 in [16] . 

Triangles. Suppose G is a tripartite graph with parts V\, V2, V3 and each pair (Vi,Vj) spans an 
e-regular bipartite graph of density dij. Let A(G) be the set of triangles in G. Then we can 
estimate the 'triangle density' in G as d&(G) = |A(G)|/|Vi HV2HV3I = ^12^13^23 ± 8e. [§ 

4-cycles. Suppose G = (X, Y) is an e-regular bipartite graph with density d. Let C±(G) be the 
number of labelled 4-cycles in G, i.e. quadruples (x±,X2, 2/1,2/2) with x\ 7^ £2 G A, y% ^ yi G Y 
such that x\yi, xij/2, X22/1 and X22/2 are all edges of G. We may define dc 4 (G), the 'bipartite 
homomorphism density' of C4 in G, as follows. Fix a 4-cycle C4 = (A,B), considered as a 
bipartite graph A^ on A = {a±, 02} and B = {b\, 62}- Let $ be the set of all 'bipartite maps' 
<f> from A U B to A U Y, i.e. functions with <p(A) C A and <p(B) C Y. Define! 

d C4 (G) = P 0e $[0(C 4 ) C G] = C 4 (G)/\X\ 2 \Y\ 2 ± 0(l/n). 

Then dc 4 (G) = d 4 ± lOe. In fact, a lower bound of d 4 follows from the Cauchy- Schwartz 
inequality, so e-regularity of G is only needed to prove the upper bound. 

General graphs. Suppose G is an r-partite graph on V = V\ U • • • U V r and each pair (Vi,Vj), 
1 < i < J < r spans an e-regular bipartite graph with density dij(G). Suppose H is an r- 
partite graph on Y = Y\ U • • • U Y r . Let $>(Y, V) be the set of r-partite maps from Y to V, i.e. 
maps 4> : Y — > V with 4>(Yi) C Vi for 1 < % < r. Define the r-partite homomorphism density 

5 The constant 8 is not best possible, but we only care that the error should tend to zero as e tends to 0. 
6 By (j){Ci) C G we mean 4>{e) £ E(G) for each edge e of C4. 
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of H in G as d H {G) = F^yy^H) C G]. Then d H (G) = \[ e£E{H) d e {G) ± H (e), where 
d e (G) means that density dy(G) for which e £ iffY^Y}] and the 0#-notation indicates that 
the implied constant depends only on H. 

2.2 Quasirandomness 

The regularity property discussed in the previous subsection turns out to be characterised by the 
counting lemma; in fact, somewhat surprisingly, it is characterised just by counting 4-cycles. To be 
precise, if < 1/n ^ e ^ e' ^ <i < 1 and G = (X, Y) is a bipartite graph with |X|, \Y\ > n, density 
d and C4-density dc A (G) < d 4 + e then G is e'-regular. In order to illuminate some later more general 
arguments we will prove this fact here, via a closely related characterisation of Gowers (see section 
3 of [7]) that forms the basis for the Gowers approach to quasirandomness in hypergraphs. 

Suppose G is a bipartite graph with parts X and Y. We can identify G with its characteristic 
function G(x, y), which for x € X and y £ Y is defined to be 1 if xy is an edge of G, or otherwise. 
Define the balanced function G(x,y) = G(x,y) — d, where d = do(X,Y) = \Eg(X,Y)\/\X\\Y\ is the 
density of G, i.e. G(x, y) is 1 — d if xy is an edge of G, or —d otherwise. Note that YlxeX y& G(x, y) = 
0. For any function / defined on X x Y we define 

c i (f) = \xr 2 \Y\- 2 E /(zi,yi)/(zi,y2)/(x 2 ,yi)/Ky 2 ). 

xx,x 2 £X yi,y 2 eY 

We can rephrase this by recalling the setup in the previous subsection, where we had a 4-cycle 
C4 = (A,B), considered as a bipartite graph i^2,2 on A = {01,02} and B = {61,62}, and let $ 
denote the set of all bipartite maps (ft from A U B to X U Y. Then 

c 4 (/)=e<k* n 

e&E(C A ) 

In particular C±{G) = dc 4 (G), where the first instance of G is to be understood as the characteristic 
function G(x, y). 

We say that a function / : X x Y — > [—1, 1] is ij-quasirandom if C^(f) < 77, and we say that G is 
ij-quasirandom if its balanced function G is 77-quasirandom. 

Theorem 2.2 Suppose G is a bipartite graph of density d with parts X and Y of size at least n. 
The following are equivalent, in the sense that each implication (i) =^> (j) is true for 0<l/n^Cej<C 
€j < d < 1: 

(1) G is e\-regular. 

(2) d Ci (G)=d i ±e 2 . 

(3) C A (G) < e 3 . 

Before giving the proof we quote a simple version of the 'second moment method', Lemma 6.5 in 
0: if < a, d < 1, ^™ =1 cij > (d — a)n and a 1 — (d 2 + a)n then «j = d±a 1//4 for all but at most 
3a 1//2 n values of i. Indeed, X^r=i( a « — = S a ? — 2d X] dj + <i 2 n < (d 2 + a — 2d(d — a) + d?)n < 3an. 
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Also, we often use the Cauchy-Schwartz inequality in the form < E(Z - EZ) 2 = EZ 2 - (EZ) 2 , for 
a random variable Z. 

Proof. (1) => (2): This implication is given by the counting lemma quoted in the previous subsection, 
but for completeness we give a proof here. Consider a random map (j) € <3?, i.e. a random bipartite 
map from AuB to XUY, where as before we consider C4 as a bipartite graph with parts A = {a±, 02} 
and B = {61,62}- Let E\ be the event that x\ = 4>(ai) has (d±ei)|Y| neighbours in Y . By definition 
of ei-regularity we have E{E\) > 1 — 2ei (there are at most e\\X\ vertices with more than (d + ei)|Y| 
neighbours and at most ei|X| vertices with less than (d — ei)|Y| neighbours). Let E2 be the event 
that X2 = 0(02) has (<i±ei) 2 |Y| neighbours in N(x\). Again, ei-regularity gives F(E2\Ei) > 1 — 2e±. 
Now the event {^{C^) C G} occurs if and only if y\ = (f>{b\) and %)2 = ^(62) he in N(x\) n Nfa), so 
we have 

d C4 (G) = E^[<p{C A ) CG}= P^$[{0(C 4 ) C G} n E 1 n E 2 ] ± 4ei = d A ± Hki. 

This proves the implication with e2 = 10ei. 

(2) =>■ (3): Suppose that dc 4 (G) = d 4 ± 62- Our first step is to show that we can also count any 
subgraph of C4, in that dp 2 {G) = d 2 ±62 and dp 3 (G) = d 3 ± 4e^ 4 , where Pj is the path with i edges 
(we already know that dp 1 (G) = da{X, Y) = d, and it is immediate that for a matching M2 of two 
edges we have dM 2 (G) = d 2 ). We start with the homomorphism density of P2, say with the central 
vertex being mapped to X and the two outer vertices to Y (the same bound will hold vice versa). 
We can write 

dp 2 (G) =E x£ x,y 1 eY,y 2 eYG(x,y 1 )G(x,y2) = E x( z X (E y€Y G(x, y)) 2 , 

which by Cauchy-Schwartz is at least (E x€ x,yeYG(x, y)) 2 = d 2 . On the other hand, we can again 
apply Cauchy-Schwartz to get 

dp 2 {G) 2 = (E X)yuy2 G(x, yi )G(x,y2)) 2 < E yuy2 (E x G(x, yi )G(x,y2)) 2 

= ^yi,y2^x 1 ,x 2 G{x 1 ,yi)G(x 1 ,y2)G(x2,yi)G(x2,y2) = d Ci {G) < d 4 + e 2 . 

This gives dp 2 {G) = d 2 ± 62, and moreover, applying the second moment method quoted before the 

1/2 

proof, for a random x € X, with probability at least 1 — 3e 2 we have E yij y 2 G(x, y±)G(x, 2/2) = 
d 2 ± e^ 4 , so |JV(x)|/|y| = E y G(x,y) = d ± e^ 4 . This allows us to estimate 

dp 3 (G) = E XUX2e x,y 1 eY,y 2 €YG(x 1 ,y 1 )G(x 1 ,y2)G(x 2 ,y2) = (d 2 ± e^ A ){d ± e^ 4 ) ± 6e2 /2 = d 3 ± Ael /4 , 

1/2 

where the main term gives the contribution when x\ and ?/2 have typical neighbourhoods and 6e 2 
bounds the error coming from atypical X\ and y2- 

Now we can estimate C^{G). Write G = fo — fi, where fo(x,y) = G(x,y) and f±(x,y) = d (a 
constant function). Then 

Ca(G) = E XuX2eX ,y uy2 eYG(xi, yi)G(xi,y 2 )G(x2, yi)G(x 2 , yi) 

,x 2 ,yi,y 2 /mh (x 1 ,y 1 )f Ml2 (x 1 ,y 2 )fM 21 (x2,yi)fM 22 (x2,y2), 

M 
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where M = (JWi i j)i,je{i,2} ranges over 2x2 matrices with {0, l}-entries and ^ M is ^ ■ M^-. For each 
M we can estimate the summand corresponding to M, using the estimate for the homomorphism 
density of the subgraph of C4 corresponding to the 0-entries of M. For example, if M\\ = M\ 2 = 
M21 = and M22 = 1, then M corresponds to a P3, and the corresponding summand is —d-dp 3 (G) = 
-d 4 ±4e2 /4 . Thus we obtain 8 summands in the range d 4 ± 4e^ 4 and 8 summands in the range 
— (i 4 ± 4e^ 4 , giving a total of at most 64e^ 4 . This proves the implication with e 3 = G^e^ 4 - 

(3) => (1): Suppose C 4 (G) < e 3 . Consider X' C X with > ei\X\ and F' C Y with 
|y'| > ei|y|- Write X'{x) for the characteristic function of X', i.e. X'(x) is 1 if x £ X', otherwise it 
is 0; similarly, let Y'(y) be the characteristic function of Y' . Then 

d G (X',Y') ~d = E xeX ,, yeY ,G(x,y) = ^MsE xeXtyeY X'(x)Y'(y)G(x,y) 

and by Cauchy-Schwartz (in the first and third inequalities below) we have 

(E xeXiyeY X'(x)Y'(y)G(x,y)) 4 < (E xeX (E yeY X'(x)Y'(y)G(x,y)f) 2 

< (E xeX (E yeY Y'(y)G(x,y)) 2 ) 2 

= (E xeX E yuy2eY Y'( yi )Y'(y 2 )G(x, Vl )G{x, y 2 )f 

< E yuy2eY (E xeX Y'(y 1 )Y'(y 2 )G(x, Vl )G{x, y 2 )) 2 

< E yuy2eY (E xeX G(x, yi )G(x,y 2 )) 2 

= Ey uy2( z Y E XUX2eX G(x 1 , yi)G(x 1 ,y 2 )G(x 2 , y 1 )G(x 2 , 2/2) 
= C 4 (G) < e 3 . 

This gives d G (X', Y') = d± £i 2 4^i which 

proves the implication with t\ = e]J 12 . □ 



2.3 Quasirandom counting 

Now we will introduce the graph case of our quasirandom counting lemma, an extension of the 
counting lemma discussed in the first subsection, saying that copies of any small fixed graph are 
well-distributed in the graph. As before, for the purpose of exposition we lead up to the general case 
through two illustrative cases. 

Triangles. Suppose G is a tripartite graph with parts V±, V 2 , V3 and each pair (Vi,Vj) spans an 
e-regular bipartite graph of density dij. We remarked before that the 'triangle density' in G 
can be estimated as d&(G) = ^12^13(^23 ± 8e. Moreover, it is easy to see that the triangles 
of G are 'well-distributed', in that any sufficiently large subsets V- C Vi, 1 < i < 3 induce 
a subgraph of G that also has triangle density about ^12^13^23. Indeed, if V- C Vi with 
\V-\ > e 1 / 2 !^!, 1 < i < 3, then each (V(,V-) induces an e 1 / 2 -regular subgraph of G with 
density dij ± e, so we can apply triangle counting directly to see that (V{, V 2 , V3) has triangle 
density (d l2 ± e)(di 3 ± e){d 23 ± e) ± 8c 1 / 2 = d 12 d 13 d 23 ± 156 1 / 2 (say). 

4-cycles. Suppose G = (X, Y) is an e-regular bipartite graph with density d. We saw before that 
we can estimate the C4-density of G as dc 4 (G) = d A ± lOe. A similar argument to that just 
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given for triangles allows us to estimate the Gi-density of G restricted to sufficient large subsets 
Xq C X and Yq C Y. But in fact, we can make a stronger claim: given any sufficiently dense 
graphs Hx on X and Hy on Y we can estimate the density of 4-cycles {xiyi, xiy?, X2II1, X2IJ2} 
of G in which x\X2 is an edge of Hx and y\y% is an edge of Hy. 

To do this, first recall our definition of dc 4 (G) as P0 g( j,[0(O4) C G], where we fix a 4-cycle 
C4 = (A, -B), considered as a bipartite graph on A = {01,02} and -B = {61,62} and let $ be 
the set of all bipartite maps from AuB to XL)Y. Consider an auxiliary '04-homomorphism' 
graph G' , defined as follows. G 1 is a bipartite graph with parts X' and Y 1 , where X' consists 
of all maps 0i : A —* X and Y~' consists of all maps 02 : B — > Y. Given 0i G X' and (ft>2 G Y' 
we can construct a bipartite map = ((fti, 02) G $ in an obvious manner: (ft restricts to <fti on 
A and (ft>2 on i?. We say that 0102 is an edge of G' if = (0i, 02) is a homomorphism from O4 
to G, i.e. <t>{ai,bj) G E{G) for i, j G {1,2}. 

We claim that if < e <C e' and G is e-regular then G' is e'-regular. To see this we use 
Theorem 12.21 By construction G' has density dQt(X',Y') = dc 4 {G) = d 4 ± lOe. Also we can 
calculate dc 4 (G') = d#- 44 (G), where ^4,4 is the complete bipartite graph with 4 vertices in 
each part. To see this, label the parts of the -^4.4 as A 1 U A 2 and B 1 U B 2 , where A 1 = {a\, a^}, 
A 2 = {a 2 , a 2 }, B 1 = {61,62}) B 2 = {6i,6|}. Then we can identify a bipartite map ift' from 
A U B to X' U Y' with a bipartite map ift from (A 1 U yl 2 ) U (B 1 U 5 2 ) to X U Y: define 
ift(ai) = ift'(ai)(a,j) and f/'C^f) = V ;/ (6j)(^j)- Now -0' is a homomorphism from G4 to G' if 
and only if ift is a homomorphism from IC14 to G: we have {^'(oj), ift'(t>k)} G G' Vi, € {1)2} 
<-> {V' / (a J )(a j ),V' / (6fc)(6£)} € G Vi,j,k,t G {1,2} <-> ft/?(af), V(&£)} 6 G V»,j,M G {1,2}. 
Therefore dc A (G') = dR- 44 (G). But we can estimate dj^ 4 4 (G) = (i 16 ± 0(e) by the counting 
lemma, so <ic 4 (G") = ^(X', Y~') 4 ± 0(e)- Now Theorem O tells us that G' is e'-regular. 

Finally, suppose that we have graphs Hx on X and iTy on Y with |i£(iTx)| > e 'l^| 2 an d 
\E(Hy)\ > e'|Y| 2 . Applying the definition of e'-regularity, we see that if we choose a random 
edge X1X2 of Hx and (independently) a random edge 2/12/2 of Hy then {xiyi, X1J/2, £22/1, £22/2} 
is a 4-cycle in G with probability (f 4 ± 2e'. 

General graphs. Suppose G is an r-partite graph on V = V% U • • • U V r and each pair (Vi,Vj), 
1 < i < j < r spans an e-regular bipartite graph with density dij(G). Suppose H is an 
r-partite graph on Y = Y\ U • • • U Y?.. Let <&(Y,V) be the set of r-partite maps from Y 
to V, i.e. maps (ft : 1" — > V with 0(1^) C Vi for 1 < £ < r. Recall that the r-partite 
homomorphism density of H in G is du(G) = P^g$(Y;v) [0(-£O O]. The counting lemma 
says that du(G) = Y\ e& E(H) d e {G) ± 0# (e), where d e (G) means that density dij(G) for which 
e G -ff [i*, 5j] and the Off-notation indicates that the implied constant depends only on H. 

Moreover, copies of H are well-distributed in G in the following sense. Consider the auxiliary 
'i7-homomorphism' graph G', defined as follows. G' is an r-partite graph on V 1 = V{ U • • • U V^, 
where V( consists of all maps (fti : Yi — * V{. Given 0j G V( and (ftj G VL for some 1 < i < j < r 
we say that (fti(ftj is an edge of G' if <fti{yi)<ftj(yj) is an edge of G for every edge yiyj of H with 
j/j G Yi and y,- G Yj. Then copies of the complete graph K r in G' correspond to homomorphisms 
from H to G. 

7 To interpret this notation, observe that e.g. ift'(a\) — ui £ X' is a map from A to X, so tft(a,i) = 0^(02) 6 X. 
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Suppose we have a constant hierarchy < e < e' < e" < 1. A similar argument to that used 
in the case of 4-cycles shows that for every 1 < i < j < r, (V-, V-) induces a subgraph of G' 

with density d e ^ {Yi ' Y]) + 0(e) that is e'-regular. Suppose also that we have a |l^|-graph Jj on 
Vi with |i?(Jj)| > e"|Vi|' yii ' for 1 < i < r. Then by e'-regular ity of G' and the counting lemma 
for K r , we see that if choose independent random edges Bi in Jj for 1 < i < r then {ei, ■ ■ ■ , e r } 
spans a copy of H in G with probability W_ e& E(H) de(G) ± e". 

3 Quasirandom 3-graphs 

In this section we discuss the Gowers approach to quasirandomness in 3-graphs: our exposition will 
be quite condensed, and for more details we refer the reader to that given in [7]. Although we will 
later repeat this discussion for general /c-graphs, we feel it is helpful to first present the case k = 3, 
which is simpler to grasp for a reader new to the subject. We conclude this section by describing 
our quasirandom counting lemma for the special case of the distribution of octahedra in 3-graphs, 
which is analogous to the distribution of 4-cycles in graphs described above; our solution of Mubayi's 
conjecture, Theorem will only make use of this case. 

3.1 Two cautionary examples 

When considering how to generalise regularity from graphs to 3-graphs, a natural first attempt is 
to take a 3-graph H and partition its vertices as V(H) = V\ U • • • U 14, for some k(e), so that 
all but at most e/c 3 triples (V a ,Vb,V c ) span a tripartite 3-graph that is e-vertex-regular, meaning 
that for any V' a C V a , V{ C V b , V' c C V c with |F a '| > e\V a \, \V{\ > e\V b \, |F C '| > e\V c \ we have 
^xeV^yeV ,zeV£H { x i z ) = ^xEV a ,yeV b ,zeV c H(x, y, z) ± e. This is indeed possible, as shown by Chung 
and Graham [3] , with a proof closely modelled on that of the graph regularity lemma. This result is 
often known as the 'weak hypergraph regularity lemma', as although it does have some applications, 
the property of vertex-regularity is not strong enough to prove a counting lemma, as the following 
example of Rodl demonstrates quite dramatically. 

Example. Take a set X = X\ U X2 U A3 U X4, where Xi, 1 < i < 4 are pairwise disjoint sets of 
size n. Consider a random orientation of the complete 4-partite graph on A, i.e. for every Xi £ Aj, 
Xj G Xj, 1 < i < j < 4 we choose the arc XiXj or the each choice having probability 1/2, 

all choices being independent. Define a 3-graph H on X to consist of all triples XiXjXk that induce 
a cyclic triangle in the orientation. Then with high probability each triple (Aj, Xj, A&) spans an 
e-vertex-regular triple when n S> e , and this will remain true even if we partition H into fc(e) 
parts. However, it is easy to see that H does not contain any copy of the tetrahedron K\. 

In the light of this example, we might informally say that vertex-regularity uses a random model 
of a 3-graph in which triples are randomly chosen as edges with some probability p, but that a 
counting lemma fails because of potential correlations between edges that share a pair of vertices. 
Our second example (adapted from [7]) points the way to a better model. 
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Example. Take a set X = X\ UX2UX3, where JQ, 1 < i < 3 are pairwise disjoint sets of size n. For 
each pair 1 < i < j < 3 define a random bipartite graph Gij, in which each x%Xj, X{ G X, Xj G Xj 
is an edge with probability djj, all choices being independent. Consider A = A(G\2 U G13 U G23), 
the triangles spanned by these graphs. Define a random 3-graph H by taking each triangle of A 
to be an edge with probability d\23- With high probability H has density d(H) = \E(H)\/n 3 = 
^12^13^23^123 + o(l). Now consider the number of copies of some fixed 3-graph F in H, or for a 
better parallel with our analysis of quasirandom graphs consider the 'tripartite F-homomorphism 
density' dp(H) = P0 g $[<^(.F) C H], where $ is the set of tripartite maps from F to H. We can no 
longer estimate this by d(H)\ E ( F ^ , as might at first be expected from the analysis for graphs. For 
a simple example, suppose that F consists of two edges sharing a pair of vertices; say F has vertex 
set A = A\ U A2 U A3 with A\ = {ai}, A2 = {02}, ^3 = {^3,03} an d edges 010203, 010203. Then 
with high probability dp{H) = ^12^13^23^123 + 

3.2 Quasirandom complexes 

The second example above shows that a counting lemma for 3-graphs must take account of densities 
of pairs, as well as densities of triples. Thus we are led to define quasirandomness not for a 3-graph 
in isolation, but for a simplicial complex consisting of a 3-graph together with all subsets of its edges. 
We say that H is a tripartite 3-complex|§ on X = X\ U X2 U X3 if we have H = U/c{i,2,3}-^J) where 
H(D = {0}, H{q is a subset of Xi for 1 < i < 3, Huj} is a bipartite graph with parts -Hm, H{j} 
for 1 < i < j < 3, and ff/1^,3} is a 3-graph contained in the set of triangles spanned by -£^{1,2}) 
-ff{i 3} and -ff{ 2 ,3}- Of course, the interesting part of this structure is the 3-graph together with its 
underlying graphs: we usually take Hnx = Xi for 1 < i < 3, and we only include so that we 
are formally correct when referring to H as a (simplicial) complex. Sometimes we allow to 
be a strict subset of Xj, but then we can reduce to the usual case by redefining the ground set as 
X' = X[ U X' 2 U X' 3 , where X[ = H {i} , 1 < * < 3. 

As usual we identify each Hi with its characteristic function, for example #123(3:1, X2, X3) is 1 if 
3:13:23:3 is an edge of #123, otherwise (henceforth we write 123 instead of {1,2,3}, etc. for more 
compact notation). We let #123 denote the set of triangles spanned by Hr 12 \, #{1,3} and #{2,3} > 
and also the characteristic function of this set. Then we define the relative density by 

dl23(H) = |i?i23|/|#l23l- 

In words, it is the proportion of graph triangles that are triples of the complex. Note that we can 
describe the densities of the bipartite graph Hij with a similar notation: we let H*j denote the pairs 
spanned by Hi and Hj, i.e. the complete bipartite graph with parts Xi and Xj, and then Hij has 
density <%(#) = {H^/ | . 

We have seen that vertex-regularity is not the correct notion for defining quasirandomness in 
3-graphs; it turns out that the other two properties described in Theorem 12.21 can be generalised 
to a useful concept for 3-graphs. These properties were defined using 4-cycles for graphs; for 3- 
graphs they are defined in terms of the octahedron O3. This is a tripartite 3-graph with vertex set 
A = AiU A2U A3, where Ai = {a®, a}}, 1 < i < 3 and 8 edges a^a^ajj 3 , where e = (ei, 62,63) ranges 

8 We prefer this term to 'chain', which is used in [7] 
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over vectors in {0, l} 3 . One could define quasirandomness of a complex in terms of the density of 
octahedra (by analogy with property 2 in Theorem 12. 2p , but we will follow Gowers and consider 
'octahedra with respect to the balanced function' (by analogy with property 3 in Theorem [22]). The 
balanced function is defined as 

Hi23(x 1 ,x 2 ,x 3 ) = H 123 (x l ,x 2 ,x 3 ) - d 123 (H)Hl 23 (x 1 ,x 2 ,x 3 ), 

i.e. for x\ G X\, x 2 G X 2 , x 3 G X3, Hi 23 (xi,x 2 ,x 3 ) equals 1 — d\ 23 (H) if x\x 2 x 3 is an edge 
of -H123, equals —d\ 23 {H) if X\X 2 X 3 is a triangle in H\ 2 U H\ 3 U H 23 but not an edge of H\ 23 , 
or equals otherwise. As before we note that E, Xl€ x 1 ,x 2 &X2,x3€X 3 Hi 23 (xi, x 2 , x 3 ) = 0. Let <E> be 
the set of tripartite maps from A = A\ U A 2 U A 3 to X = X\ U X 2 U X 3 . For any function 
/ : X\ x X 2 x X 3 — > [—1, 1] we define 

eG£(0 3 ) 

We say that / is r/-quasirandom (with respect to H) if 3 (f) < rj(di 2 di 3 d 23 ) 4 (note that we are 
assuming that d\ = d 2 = d 3 = 1). One can think of this as saying that 3 (f) is small compared to 
the density of the 'graph octahedron', i.e the complete tripartite graph K(2, 2, 2), as this has density 
about (di 2 di 3 d 23 ) 4 when the graphs Hy are quasirandom (as they will be). Then we say that H\ 23 
is 77-quasirandom (with respect to H) if its balanced function H\ 23 is r/-quasirandom. 

3.3 Quasirandom decomposition and the counting lemma 

Next we will see why the notion of quasirandomness given in the previous subsection is useful: there 
is a decomposition theorem and a counting lemma. 

To describe the decomposition theorem we need to first think about the kind of structure that will 
arise in a decomposition into simplicial complexes. It will not simply be a disjoint union of simplicial 
complexes: just as in a graph decomposition the various bipartite graphs may share vertices, in a 
3-graph decomposition the various tripartite 3-complexes may share vertices and pairs. Suppose H 
is an r-partite 3-graph on X = X\ U • • • U X r . A decomposition of H is described by partitions 
Xi = Xj U • • • U X™ 1 of each part Xi into subsets, and partitions K(Xi, Xj) = Gjj U • • • U G™^ of each 
complete bipartite graph with parts Xi, Xj into bipartite subgraphs. Let P denote this 'partition 
system'. For each edge e of H there is an induced complex H(e,P) defined as follows. The vertex 
set of H(e,P) is X? U Xf U X^ k , where e = x l x j x k with x t € X a % \ Xj G Xf , x k G X^ . The 
pairs in H(e, P) are given by the restriction of G°- J U G^ k U G^ fe to the vertex set of H(e, P), where 
XiXj G G^, XiXk G G^, XjXk G G^ fc . The triples in H(e,P) are those edges of H that also form 
triangles in the pairs of H(e,P). 

The following decomposition is a variant of Theorem 8.10 in [7]. (We defer a justification of the 
differences here and elsewhere until the appendix.) 

Theorem 3.1 Suppose < 1/n <C d\ r] 2 d 2 <C r] 3 1/r, e < 1 and H is an r-partite 3-graph 
on X = XiU- • -UXr, where each > n. Then there is a partition system P consisting of partitions 
Xi = Xf U X} U • • • U X™ 1 with each m < l/d\ and partitions K(Xi,Xj) = Gjj U • ■ • U G^f with each 
n-ij < d 2 1 ^ 2 , such that, 
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• each exceptional class Xf has size at most e|Xj|, for 1 < i < r, 

• the classes X^ % , 1 < i < r, 1 < < m all have the same size, and 

• if e is a randomly chosen edge of H , then with probability at least 1 — e, in the induced complex 
H{e,P), the graphs are rj2-quasirandom, and the 3-graph is r/^- quasirandom with respect to 
H(e,P). 

To apply this decomposition we need a counting lemma, estimating the number of homomor- 
phisms from a fixed r-partite 3-complex F on Y = Y\ U ■ ■ ■ U Y r to a quasirandom r-partite 3-complex 
H on X = X\ U • • • U X r . Let <1>(Y, X) denote the set of r-partite maps cf> : Y — > X. We define the 
partite homomorphism density of F in H byo 

The following theorem is a variant of Corollary 5.2 in [8]. 

Theorem 3.2 Suppose < r/2 <C c?2 % <C ^3, e, < 1, i/iaf and F are r-partite 3-complexes 

on X = X\ U • • • U X r and Y = Y\ U • • • U Y r respectively, every graph Hij is r/2 -quasirandom with 
density at least di, and that every 3-graph Flijk is 773- quasirandom with relative density at least dj, 
with respect to H . TTierel^l 

n^(e)) = (l± e )[l4(ff). 

Remarks. (1) The special case when F is a single edge shows that the absolute density of a triple 
in H is well-approximated by the product of its relative densities, viz. 

d(Hijk) = \Hijk\/\Xi\\Xj\\Xk\ = (1 ± e)dijdikdjkdijk- (1) 

(2) It is important to note the hierarchy of the parameters, as this is where some of the technical 
difficulties in hypergraph regularity lie. In the decomposition the number of graphs Gij in the 
partition may be much larger than r/^" 1 , so the counting lemma has to cope with graphs i?y with 
density much smaller than 773, the quasirandomness parameter for triples; fortunately it can be 
arranged that r/2, the quasirandomness parameter for pairs, is smaller still, and this is sufficient for 
the counting lemma. 

3.4 Counting homomorphisms to functions and uniform edge-distribution 

We will also need the following more general theorem, a variant of Theorem 5.1 in [8], which allows 
estimation of the number of F-homomorphisms with respect to functions supported on H; the 
generalisation is similar to that which we saw earlier from counting 4-cycles in quasirandom graphs 

9 Note that we use the same notation for the normal homomorphism density: it will be clear from the context which 
is intended, and in any case they are roughly equivalent for our purposes in that they only differ by a constant factor. 
10 Recall that d e (H) means the density di(H) for which e G Hi. 
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to estimating C^f) for a function /. Intuitively it expresses the fact that, in a quasirandom 3- 
complex, sets of size k < 3 are almost uncorrelated with sets of size less than k. First we need some 
notation. Suppose X = X\ U • • • U X r is a set partitioned into r parts. For A C [r] let Ka(X) be 
the set of all ^-tuples with one point in each Xj, i £ A. We say that e 6 K A {X) has index A and 
we sometimes abuse notation and use e instead of ^4; thus, if H is an r-partite 3-complex on X and 
e e H A for some AC [r], \A\ < 3 then ff e = H A . 



Theorem 3.3 Suppose < n 2 <C ^2 <C % *C <i 3 , e, l/[-F| < 1, i/ia£ -ff and F are r-partite 3- complexes 
on X = X\ U ■ ■ ■ U X r and Y = Y\ U • • • U Y r respectively, every graph Hij is 772- quasirandom with 
density at least d2, and that every 3-graph H^ is n 3 - quasirandom with relative density at least d 3 
with respect to H . Suppose Fq is a subcomplex of F, for each e € F we have a function f e on K e (X) 
with f e = f e H e , and f e = H e for all e 6 F \Fq. Then 
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Note that Theorem 13.21 is the case of Theorem 13.31 when Fq = 0. The following corollary shows 
that quasirandomness implies vertex-uniformity, the analogue of graph regularity that we mentioned 
earlier as being too weak for a useful hypergraph regularity theory. 



Corollary 3.4 Suppose < 7/2 <C ^2 773 <C d 3 ,e < 1, that H is a tripartite 3-complex on 
X = X\ U X2 U X 3 with Hi = Xi, 1 < i < 3, i/ia£ #12, #13 aTid ^23 o.re 1J2- quasirandom with 
density at least d2, and that H123 is 773- quasirandom with relative density at least d 3 with respect to 
H. Suppose Wi C X i; 1 < i < 3 ana 1 /ei H 12 3[W] be the restriction of H 123 to W = W 1 UW 2 UW 3 . 
Then 

I #123 [1*1 1 I #123 1 ( \Wi\\W2\\W^ \ 

Proof. Set Y; = {7} for 1 < i < 3, F equal to all subsets of Y = Yi U Y 2 U Y 3 and F CF equal to 
the subsets of size at most 1. Apply Theorem 13.31 with fi(xi) = l^gWi and f e = H e foie£F\ Fq. 
Since Hi = X h 1 < i < 3 we have ILeFVFo d e(-#) = iLeF^^)- Also E </»e<i»(y,x) iLeFo fe(4>( e )) = 
n- = i Now E^ e< j, (yiX) [n eeF /e(0(e))] = \H 123 [W]\/\X 1 \\X 2 \\X 3 \ so Theorem gives the 

result. □ 



3.5 Uniform distribution of octahedra in quasirandom 3-complexes 

Now we will establish a further special case of our quasirandom counting lemma, the uniform dis- 
tribution of octahedra in quasirandom 3-complexes, which is the heart of our solution of Mubayi's 
conjecture. Recall that the octahedron O3 is a tripartite 3-graph with vertex set A = A\ U A2 U A 3 , 
where Ai = {a? i: a\}, 1 < i < 3 and 8 edges a^aS^a^ 3 , where e = (ei,e2,e 3 ) ranges over vectors in 

1 Here we are using the notation described before the theorem: if e has index some A C [r], \A\ < 3, then K e (X) = 
Ka{X) = FligA The equation / e = f e H c is a concise way of saying that f e is supported on H e = Ha- 
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{0, l} 3 . We write Of for the tripartite 3-complex generated by 3 (i.e. it contains all subsets of the 
edges of 3 ) and 3 = 3 \ 3 for the tripartite 2-complex of strict subsets of edges of 3 . 

Suppose H is a tripartite 3-complex on X = X\ U X2 U X 3 , where H = Xi, 1 < i < 3. We 
define an 'auxiliary' tripartite 3-complex H' on X' = X[ U X 2 U X 3 , where X[ consists of all maps 
4>i : Ai — > Xi, 1 < i < 3 as follows. The triples -£^23 consist of all <^>i<^>203 for which = (</>i, 02> ^3) is 
a homomorphism from O3 to H; the pairs fl| - consist of all 4>i4>j such that (0j, (/>j) is a homomorphism 
from (03)^ to Hij, for 1 < i < j < 3. 

Theorem 3.5 Suppose < 1/n 772 "C £2 <C ?? 2 ^ ^2 <C % £3 ?7 3 <C e', c?3 < 1, i/iai H is a 
tripartite 3-complex on X = X% U X2 U X 3 , where Hi = Xi with \Xi\ > n, 1 < i < 3, that Hyi, H\ 3 
and H23 are r]2-quasirandom with density at least d<i, and that #123 is r] 3 -quasirandom with relative 
density at least d 3 with respect to H . Then, in the auxiliary complex H ' , each H'- is r]' 2 -quasirandom 
with density (1 ± e2)dij(H) A , and H' l23 is rj' 3 -quasirandom with relative density (1 ± £3)^123 {H) s with 
respect to H' . 

Furthermore, suppose that Gi is a graph on Xi with \E(Gi)\ > e'\Xi\ 2 , 1 < i < 3 and we randomly 
and independently select edges Xix\ of Gi, 1 < i < 3. Then {x\x[, X2X 2 , x 3 x 3 } spans an octahedron 
in H with probability (1 ± e')d 1 2(H) 4 d 13 (H) i d 23 (H)' l d 1 2 3 (H) 8 . 

Proof. It may help the reader to note that the prime symbol ' is a visual cue, distinguishing quantities 
referring to H' from those referring to H. We write di = di(H) and d'j = di(H') for / C {1,2,3}, 
|/| = 2,3. @ The quasirandomness of each H-j follows from the discussion of quasirandom counting 
of 4-cycles in subsection 2.3, so it remains to establish the claim for H[ 23 . The final statement will 
then follow from Corollary 13.41 and equation ([I]). 

First we estimate the relative density of H' using equation ([T]): we have (using £3/3 instead of e) 

d(H[ 23 ) = i^ 23 |/l^il 2 l^2i 2 |x 3 | 2 = p^ ( A,x)[<K<^) ch} = (i± 63/3)4,4,44,3, 

and 

d(H[* 23 ) = |^2 3 l/l^i| 2 |^2| 2 |^3( 2 = W<j>e*(A,x)[<l>(0<) C H] = (1 ± e 3 /3)dj 2 di 3 d 4 23 , 
so d 123 (H') = \H' 123 \/\H[ 23 \ = (1 ± e 3 )df 23 . 

Next we need to estimate 03(-ff'i23)- Write /q = H[ 23 (the characteristic function) and f[ = 
d i23#i*23> so that H 'i23 = /o - /{• Then 

3 (W 123 )=E 4> , eHAX , ) H W 1 2 3 (4>'(e)) = X)(-l) Ei E^ e *(A^o II 4^'( e ))> 

ee03 j e£03 

where j ranges over all {0, l}-sequences (j e )eeo 3 and Ylj = S e e0 3 ie- We interpret each summand 
as counting homomorphisms to functions from O3O3, by which we mean (with unusual notation!) 
the complete tripartite 3- graph with 4 vertices in each class (cf. our earlier analysis of C4 via ^£4 4.) 
To accomplish this we consider a complete tripartite 3-graph, which we call O3O3, with 3 parts 
A i U Ai t i, 1 < i < 3, where Aij = {a°j, a l t j}, 1 < ^ < 3, j = 0, 1. We think of the edges of O3O3 as 

12 To avoid confusion, note that d,2, ds are lower bounds for dij, di23 respectively, not relative densities of H2, H3. 
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Figure 1: O3 and O3O3: the left index identifies part 1, 2 or 3; the right index identifies the copy Of 
of O3; the top index determines which edge we consider within a copy of O3. 



a union of 8 copies of O3, indexed by O3, thus £(0303) = U ee o 3 Of , where, writing e = {a^ 1 , a^ 2 , a^ 3 }, 
Of is the copy of O3 with parts {a° , aj }, 1 < i < 3 (see figure [[]). @ 

We identify a tripartite map <p' from ,4 = ,4i U A<i U A3 to X' = X[ U -X" 2 U X' 3 with a tripartite 
map <j> from ,4,4 = (A lfi U ^1,1) U (,4 2 , U A 2 ,i) U (A 3 ,0 U A 3jl ) to X by defining 0(o^) = </»'(a|)(a t fc ). 
Thus for e = {a^ 1 ,^ 2 ,^ 3 } G O3, </> e = (0'(a^), (//(a^ 2 ), 0'(af )) acts on Of. We can rewrite the 
summands above using 

nw))=nnw)) = n 

e€0 3 ee0 3 eGO§ E€0 3 3 

where we write e(i£) for that e G O3 such that .E € Of, /o = -^123 (the characteristic function) and 

h = «23) 1/8 ^r 2 3 = (1 ± £3/4)^123^23- 

Now we can estimate the contribution from the summand corresponding to j = (j e ) eg o 3 using 
the counting lemma for homomorphisms to H from a tripartite 3-complex Kj on ,4,4, in which the 
pairs form the complete tripartite graph on AA and the triples are U e: j e= oOf . We have 

^4>'e>s>(A,x>) Y[ •/j / e (^'( e )) = E </-e*(AA,x) n w*(*o) 

eG0 3 Ee0 3 3 

= (423) E ^ e *(AA,X)[0(^)^^] 
= ((1 ± e 3 )d? 23 )^ • (1 ± £3)^14141^23 
= (l±0( e3 ))K 2 4343) 4 W23) 8 , 
13 Some readers may like to think of this construction as a 'tripartite tensor product'. 
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Figure 2: PG 2 {2) and H 2 {n). 
where the implicit constant in the O(-) notation is absolute, say 100. Therefore 

03(^123) = ± o(t3)M 2 d[A 3 )\d' 123 ) 8 

j 

= ^0(e 3 )(d' 12 d' 13 d' 23 ) 4 (d' l23 ) 8 < rj' 3 {d' l2 d' l3 d' 23 ) 4 , 

3 

since e 3 <C r}' 3 . This proves that H[ 23 is f^-quasirandom with relative density (1 ± e)di 23 (H) s with 
respect to H. The third statement of the theorem follows from Corollary 13.44 so we are done. □ 



4 The Fano plane 

In this section we prove Theorem 11.11 thus establishing Mubayi's conjecture. Our key tool will be 
Theorem 13.51 our result above on uniform distribution of octahedra in quasirandom 3-complexes. 
We also make use of the idea of stability, or approximate structure, which can be traced back to 
work of Erdos and Simonovits in the 60's in extremal graph theory. Informally stated, a stability 
result tells us about the structure of configurations that are close to optimal in an extremal problem: 
for example, a triangle-free graph with n 2 /4 — o(n 2 ) edges differs from a complete bipartite graph 
by o(n 2 ) edges. Such a result is interesting in its own right, but somewhat surprisingly it is often a 
useful stepping stone in proving an exact result. Indeed, it was developed by Erdos and Simonovits 
to determine the exact Turan number for fe-critical graphs. 

To further explain our method we first need some definitions. The Fano plane PG 2 {2) is the 
projective plane over F2 = {0, 1}, the field with two elements: it is a 3-graph in which the vertex set 
consists of the 7 non-zero vectors of length 3 over F2 , and the 7 edges are those triples of vectors abc 
with a + b = c (see figure [2]) . A natural construction of a 3-graph not containing the Fano plane is 
a balanced complete bipartite 3-graph H 2 (n), which we recall is obtained by partitioning a set into 
two parts X = Xi L) X 2 with sizes as equal as possible and taking as edges all triples that are not 
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contained in either part (i.e. that intersect both parts): it is easy to verify that the Fano plane is 
not bipartite, so is not contained in H2{n). 

In 1976 Sos [26] conjectured that H.2{n) gives the exact value of ex(n, PGr2(2)). This was estab- 
lished for large n, together with the characterisation of H2(n) as the unique maximising configuration, 
independently and simultaneously by Keevash and Sudakov [13] and Fiiredi and Simonovits [6]. In 
|13j the following stability method was used: the first step was to show that a Fano-free 3-graph with 
(3/4 — o(l))(g) edges differs from a complete bipartite 3-graph by o(n 3 ) edges; then, the second step 
was to examine the possible imperfections in structure in a 3-graph that is close to being bipartite, 
showing that they exclude more edges in a Fano-free 3-graph than a complete bipartite 3-graph with 
no imperfections. 

In considering the codegree problem, Mubayi [19] noted that the same construction H2{n) gives 
a lower bound ex2(n, PG*2(2)) > \n/2\ and proved that -^(PG^^)) = 1/2, so this bound is asymp- 
totically best possible. In our solution of the exact problem, the first step will be to use Theorem 13.51 
to obtain some partial structure of Fano 3-graphs with nearly maximal minimum codegree, namely 
that a Fano 3-graph H on n vertices with minimum 2-degree (1/2 — o(l))n has a sparse set of size 
(1/2 — o(l))n. The second step (fairly straightforward) is to complete the approximate structure, 
showing that H differs from a complete bipartite 3-graph by o(n 3 ) edges. Then the final step is 
to examine the possible imperfections in structure to show that the largest minimum codegree is 
attained with no imperfections: this is similar in spirit to the analysis of the normal Turan problem, 
although there are some additional technical difficulties for the codegree problem. We will divide the 
three steps of the proof into subsections of this section. 

4.1 The sparse set 

The first step of the proof is the following application of Theorem 13.51 

Theorem 4.1 Suppose < 1/n -C 9 -C e -C 1, H is a 3-graph on a set X of n vertices that does 
not contain a Fano plane, and 62(H) > (1/2 — 6)n. Then there is a set Z C X with \Z\ > (1/2 — e)n 
such that d(H[Z]) < e, i.e. Z spans at most e\Z\ 5 /6 edges of H. 

The idea of the proof is as follows. First we apply Theorem 13.11 to decompose H into induced 
3-complexes, most of which are quasirandom. Then a simple double-counting gives us a vertex x such 
that there are about half of the vertex classes in the partition on which x has a dense neighbourhood 
graph. For each triple of such classes, each quasirandom induced 3-complex on these classes must be 
very sparse; otherwise Theorem 13.51 gives us an octahedron in which the pairs from each class are in 
the neighbourhood graph of x, and this structure contains a Fano plane. This implies that we have 
about half of the vertex classes in the partition inducing a sparse subhypergraph of H. 

Proof of Theorem 14.11 We operate with a constant hierarchy < 1/n <C d\ <C ??2 <C ^2 <C 
f]z <S ds, 7 -C 9 <C e -C 1. Set r = and let X = X\ U • • • U X r be an arbitrary partition with 

\\Xi\ — n/r\ < 1 for 1 < i < r. Let Hi be the edges of H that respect this partition, i.e. have at 
most one point in each Xi. Applying Theorem 13. II to the generated r-partite 3-complex H^, using 7 
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instead of e, we obtain a partition system P consisting of partitions Xi = Xf U X\ U • • • U X™* with 
each rii < l/d\ and partitions K{Xi, Xj) = Gjj U • • • U with each < d 2 1 ^ 2 , such that, 

• each exceptional class Xf has size at most 7|Xj|, for 1 < i < r, 

• the classes X? 1 , l<i<r,l<ai<rii have the same size, say m, and 

• if e is a randomly chosen edge of Hi, then with probability at least 1 — 7, in the induced 
complex Hf(e,P), the graphs are 772-quasirandom, and the 3-graph is 773-quasirandom with 
respect to H^(e,P). 

Next we find a vertex x as described in the sketch before the proof. Consider the set of all 
pairs contained in the vertex classes of P: denote it Y = ^utA&L^iMi wnere I = {(hh) '■ 1 < i < 

r, 1 < U < Hi) and Y iM = ( x * ). (Note that we are not considering any exceptional classes here.) 
Double-counting gives 

\N H (x) n Y\ = Wh(Y) nx\> \Y\(l/2 - 9)n, 

x£X y<=Y 

so we can choose x with \N H (x) n Y\ > (1/2 - 0)\Y\. Let C = {(i,t») : \N H (x) n Y i>t .\ > 6\Y iM \. 
Then, since = m for every (i,ti) G /, 

(1/2 - 9)\I\ Q = (1/2 - 6)\Y\ < \N H (x) HY\< \C\ Q + \I \ C\9^fj , 

so |C| > (1/2 - 20)|I|. Let Z = U (iA)eC X* 1 . Then \Z\ = \C\m > (1/2 - 39)n. 

Consider any induced tripartite 3-complex Hf(e,P) in which the three vertex classes are Xj 9 , 
1 < a < 3 with (i a , t a ) £ C, 1 < a < 3 and ^1,^2, «3 are distinct. We claim that it cannot be that all 
three graphs in H^(e, P) are 772-quasirandom with density at least cfo and the 3-graph in H^(e, P) is 
773-quasirandom with density at least d^. For then, since 7 <C 6, Theorem 13.51 gives us an octahedron 
(in fact many!) with parts {x a ,x' a }, 1 < a < 3 in which x a x' a is an edge of Nh{x) n!i a) t a , 1 < a < 3. 
However such a structure clearly contains a Fano plane, which contradicts our assumptions. 

It follows that at least one quasirandomness condition or density condition fails for each induced 
tripartite 3-complex H^(e,P) with vertex classes in Z. Now the choice of partition system P 
guarantees that at most jn 3 edges of Hi belong to induced complexes which fail a quasirandomness 
condition. Also, at most c^n 3 edges of Hi belong to an induced complex for which the 3-graph 

1/2 Q 

has relative density at most d^, and at most 3d 2 n edges of Hi belong to an induced complex in 

l/o 

which one of the 2-graphs has relative density at most d2 (since < d~ for every Finally, 
we can estimate the number of edges of H that do not respect the partition X = Xi U • • • U X r as 
I if \ ifi| < n 3 /r. We conclude that d(H [Z]) < e, as required. □ 

4.2 Stability 

The remainder of the proof makes no further use of quasirandomness. The next step is to use the 
sparse set to deduce a stability result, i.e. that a Fano-free 3-graph with minimum codegree about 
n/2 is approximately bipartite. 
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Theorem 4.2 Suppose < 1/n < 9 < e < 1, H is a 3-graph on a set X of n vertices that does 
not contain a Fano plane, and 62(H) > (1/2 — 6)n. Then there is a partition X = A U B such that 
at most en 3 edges are contained entirely within either A or B. 



Proof. It is convenient to introduce a constant (j) with S < ^) C f. By Theorem 14.11 we can find 
A C X with \A\ > (1/2 - <j>)n and d(H[A\) < 0, i.e. \H[A]\ < 0\A\ 3 /6. Let B = X \ A. Then 

3\H[A]\ + £ |tf H (p)na|= £ \N H (p)\>0f)(l/2-O) 



so 



E 

beB 



N H (b) n 



2) = E Np)nB|>(l/2-fl)n^) 



^|A| 3 /2 > (1/2 - 2<j>)n 



A\ 



This implies that \B\ > (1/2 - 2(f>)n, so both |A| and \B\ are (1/2 ± 2<j>)n. Let 



b € B : 



N H {b) n 



< 1 



Then 



(1/2 - 2(j))n 



beB 



N H (b) n 



.4 



< |5 |(1 
= \B 



l1/2> 



a/2 



+ |5\B | 



so |B | < 0~ 1/2 (l#l - (1/2 - 2<p)n) < 3cj) 1 l 2 n. 

We claim that there is no edge e £ H[B\ Bq]. Otherwise we would have 



f| N H (b) n 



bee 



A 



> (1 -30 



l/2> 



Then, since <f> <S 1, n{, ee iV#(i>) n ( 2 ) contains a copy of K4, the complete graph on 4 vertices: we can 
see this by Turan's theorem, which gives Tt(K±) = 2/3, or even just a simple averaging argument which 
gives ir(K±) < 5/6. However, this K± and e span a copy of the Fano plane, contradiction. Therefore 
every edge of B contains a point of B , so \H[B}\ < \B \\B\ 2 < 3<p 1/2 n 3 . Since \H[A}\ < (f>\A\ 3 /6 
there are less than en 3 edges contained entirely within either A or B. □ 



4.3 The exact codegree result for the Fano plane 

Finally we use the stability result above to prove Theorem 11.11 which states that if n is sufficiently 
large and H is a 3-graph on n vertices with minimum 2-degree at least n/2 that does not contain a 
Fano plane, then n is even and H is a balanced complete bipartite 3-graph. 
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Proof of Theorem 11.11 Let e be sufficiently small and n > no(e) sufficiently large. By Theorem 
14.21 we have a partition X = Xq U X\ so that |iJ[Xo]| + |-ff[Ai]| < en 3 . Choose the partition that 
minimises [A~o]| + | . We will show that this partition satisfies the conclusion of the theorem. 

Note first that the same argument used in the proof of Theorem 14.21 shows that \Xq\ and are 
(l/2±2e)n. 

Next we show that there is no vertex x S Xq with degree at least e 1//4 n 3 in i7[Xo]. For suppose 
there is such a vertex x. By choice of partition we have |iVfl-(x)n | > | NH(x)n (^°) | > e 1 / 4 n 2 , or we 
could reduce |.ff[Ao]| + \H [Xi] | by moving x to X\. We can choose matchings M = {x}x 2 , • • • , x^x^} 
in N H (x) fl (*°) and M' = {y\y\,--- in N H (x) D ( X 2 1 ), with m = e 1 / 4 n/2; indeed, it is a 

well-known observation that any maximal matchings will be at least this large, as the vertex set of 
a maximal matching in a graph covers all of its edges. Now we use an averaging argument to find a 
pair of edges of M that are not 'traversed' by many edges of H, in the following sense. We have 

E E I^WR 1 '^})! < 3 l^o]| < 3en 3 , 

/={n,i2}e(^V ={jl ' j2}e{1 ' 2}2 

so we can choose I € ('™') such that 

E \N h[Xq] ({xH,xII})\ < ( m \ \en 3 < 50c 1 / V 
J={h,h}m 2 v 7 

Also, for every 1 < k < m, H cannot contain all 8 edges x^x^yj^ , with ji, j'2, J3 in {1,2}, as then 
together with x we would have a copy of the Fano plane. By double-counting we deduce that there 
is some pair p = x^x^ such that there are at least m/4 vertices y 3 ^ that do not belong to Nh(p). 
This gives \Nh(p)\ < 50e 1//2 ra + \Xi\ — e 1//4 n/8 < (1/2 — e)n, which contradicts our assumptions. We 
deduce that there is no vertex in Xq with degree at least e 1 / 4 n 2 in iT[Xo]. Similarly there is no vertex 
in X\ with degree at least e 1//4 n 2 in H[X\]. 

Write \Xq\ = n/2 + t and \X\\ = n/2 — t, where without loss of generality < t < 2en. Suppose 
for a contradiction that either t > or t = and there is an edge in H[Xq] or Note that for 

every pair p € (^ ) we have \Nh{p) ^Xq\ > \Nn(p) \ — \Xi \ > t. Thus we can assume there is at least 
one edge in Xq (since the case t = is symmetrical). Let Yi C Xi be minimum size transversals of 
H[Xi] (i.e. |e n li| ^ for every e in H[Xj\). Then Yq ^ 0. Also, by the previous paragraph 

|y | e 1/4 n 2 > E Wh[x ](x)\ > \H[X }\ = i E \ N H[X 0] (P)\ > I 
xeY pg( x 2 o) 

so |Yo| > e _1 / 4 |n~ 2 (™^2 + *) > 50t, say, since e is small. 

For each edge e G Xq consider all possible ways to extend it to a copy of the Fano plane using 
some F € ( 4 )• Since H does not contain a Fano plane there is some triple with 2 points in F and 
one point in e that is not an edge. We count each such triple at most (' Xl 2~ 2 ) times, so we get a set of 
at least (' ^ ) ( 4 J distinct triples. Thus there is some point x in e for which we have a set M x 
of at least K'^J -2 ) ('"4 1 ') > ^j(''2 1 ') ' m issing' triples involving x and a pair in (^) not belonging 
to Nh{x). Now varying e over all edges in Xq, we get sets M x , x E T of at least ^('^j 1 ') 'missing' 
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triples involving x, for some transversal T. Since Yq is a minimum size transversal, M = U x ^tM x 
has size at least |M| > |1q| • 

Now, for each pair p G (^) we have 

n/2 < \N H (p)\ = \N H ( P )nx \ + \N H ( P )nx 1 \ < \X \ - \N M (p)\ + \N H (p)nX 1 \, 



so 



3\H[Xi]\= W H (p)nX 1 \> (\N M (Q)\-t) = \M\-tO X 2 ll \ 

P< X J) p^m 

Since |M| > ^[loK 1 ^ 1 ') and i|F | -t > ^l^ol we have |-ff[Xi]| > ^ \Y \ . Also lYile 1 / 4 ^ 2 > 
Exen l^[n](^)l > so |H| > e -V4_i_|y | n -2(n/2-i) > ^ gayj gince £ ig gmaU 

Finally we can apply the argument of the previous paragraph interchanging Xq and X\. We 
get a set M' of at least \Y±\ ■ 2 ) distinct triples that are not edges, each having 2 points in 
Xq and 1 point in X\. For each pair p C Xq we now have \Nh(p) n Xq\ > \Nm'(p)\ + t, so 
\H[X }\ > I ■ (|^i |/20 + t) ('^ o| ) and \Y \ > e^^n-^HlXoW > 2\Yi\. This contradiction completes 
the proof. □ 



5 Quasirandom hypergraphs 

We now return to the theory of quasirandom hypergraphs: we will discuss the Gowers approach in 
full generality, together with some variants that we need for our arguments, the general form of our 
quasirandom counting lemma, and its application to generalised Turan problems. The essential ideas 
are already present in the discussion above for graphs and 3-graphs, so we will be fairly brief. We 
refer the reader to [8] for full details of the Gowers theory, with the proviso that we have also adopted 
some notation and terminology from [9] and made other changes for consistency of notation. This 
section is divided into four subsections: the first extends the notation and definitions introduced 
above to general /c-graphs, the second contains the general forms of the decomposition theorem and 
counting lemma, the third the general form of our quasirandom counting lemma, and the fourth a 
generalised form of Theorem 14.11 

5.1 Definitions 

Consider a set X = X\ U • • • U X r partitioned into r parts. For brevity call this an r-partite set. 
A C X is an r-partite subset if it contains at most one point from each Xj. An r-partite /c-graph 
on X is a /c-graph on X consisting of r-partite subsets. An r-partite /c-complex H is a collection of 
r-partite subsets of X of size at most k that forms a simplicial complex, i.e. if S £ H and T C S 
then T £ H. If If is a fc-graph then we can generate a /c-complex H- = {T : 3S G H,T C S} and 
a {k — l)-complex H < = {T : 3S € H, T C S} (strict subsets). If H is an r-partite fc-complex on 
X = X\ U • • - UX r and A G (M) we define an |^4|-complex H A < = UbcaHb and an (\A\ — l)-complex 
Ha< = UbcaHb- 

The index of an r-partite subset A of X is i(A) = {i G [r] : A DJQ / 0}. Let K A (X) = {S C X : 
i(S) = i(A)}. If H is a A;-graph or fc-complex we write H A for the collection of sets in H of index 
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A. In particular, if S is an r-partite subset of X and A C i(S) then Sa = S PI UigA-^Q- We also use 
i?A : if^(X) — > {0, 1} to denote the characteristic function of this set, i.e. Ha(S) is 1 if S G Ha and 
otherwise. 

Write H\ for the collection of sets S of index A such that all proper subsets of S belong to 
H. (Note that Ha Q H a .) We also use H* A to denote the characteristic function of this set. The 
relative A-density of H is cIa(H) = \Ha\/\H a \. We shorten this to d,A if H is clear from the context. 
In particular we have = 1, since Hq = Hq = {0}. We often assume that H^x = X,- L , so that 
dux = 1 for 1 < i < r: in general we can apply a result obtained under this assumption by replacing 
X = X\ U • • • U X r with X' = X[\J ■ ■ ■ \J X' r , where X- = Hux. We also use an unsubscripted d to 
denote (absolute) density, e.g. d{HA) = \Ha\/\Ka{X)\. 

Suppose Y = Y\ U • • • U Y r is another r-partite set. We let $(Y, X) denote the set of all r-partite 
maps from Y to X: these are maps <f> : Y — > X such that (f>(Yi) C Xj for each i. If J is a /c-graph or 
fe-complex on Y and H a fc-graph or /c-complex on X we say that is a homomorphism if <^>( J) C iT. 

For each number i let L7j = {u®,u}} be a set of size 2 and let C/ = Ujf/j. For a set of numbers 
A let C/a = UjgA^j- The A-octahedron on C7 is Oa = {B C JJa : |J3 n J7j| = 1, i G A}. Suppose 
that for each B E Oa we have functions /b defined on Ka{X). Their Gowers inner product is 
{{fB}BeO^)n d = E u>e$(c/,X) HBeO A /b(^(-B)). (Effectively we are averaging over u in $>(U A ,X A ), 
but extending the function does not affect the average.) Given a function / defined on Ka(X) let 
Oct(/) = ({/b })□<*) where = / for every 5 G Oa- Say that / is ^-quasirandom relative to H if 

Oct(/)<T,nB e0 <dB(H)r 
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If H is a /c-complex on X and ^4 E (^[) = {B : B C [r], < A;}, the balanced function of is 
Ha = (Ha — dA)H* A = Ha — dAH A . We say that Ha is 77-quasirandom if its balanced function Ha 
is 77-quasirandom. If F is a fixed r-partite /c-complex we say that H is (e, F, fc)-quasirandom if Ha is 
r/|A|-quasirandom for every A £ F, where the 'hidden parameters' 772, - • ■ , % are defined recursively 
by e k = e, and r] s = \(e s Y\ A eF,\A\>s dA) 2 * , e s -i = ^l^l" 1 IlLs 2 ~* for 2 < s < /c. This terminology 
is a convenient way of expressing a sufficient condition for approximate counting of homomorphisms 
from F to H. It is sometimes helpful to have a notation including the hidden parameters, thus we 
say that H is r/-quasirandom, where rj = (771, ■ ■ ■ ,rjk), if Ha is r/|A|-quasirandom with respect to H 
for every A E (M) (we can take 771 = 0), and we say that H is d-dense, where d = (d\, ■ ■ ■ ,d^), if 
Ha has relative density at least d\ A \ with respect to H for every A G (<],) . We also say that H is 
(.F, d)-dense, if Ha has relative density at least d\A\ with respect to H for every A £ F. There is a 
natural hierarchy < 771 <C d\ <C 772 "C £2 <C c?2 "C • • • <C % ^ £fc ^ dfc> — 1 that arises in 

applications, and assuming that the functions implicit in the <C-notation decay sufficiently quickly, 
if H is r^-quasirandom and (F, <i)-dense then H A < is (eiAb-^ASj |^4|)-quasirandom for every A G (<].)■ 

Comments. We prefer the term /c-complex (/c-chain is used in [8]), as we are dealing with simplicial 
complexes. We are using the letter d for densities (5 is used in [8]) to reserve the letter d for minimum 
degree or density. We should emphasise that d with a set subscript indicates relative density with 
respect to that set, whereas d with no subscript indicates (absolute) density (which will be generally 



14 Recall that is a (fc — l)-complex consisting of strict subsets of the sets in Oa, and write ds(_ff) = d^g^H) for 
less cumbersome notation. This will be a general rule: a subscript A should be understood as i(A) where appropriate. 
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well-approximated by a product of relative densities). For the sake of consistency we always use set 
subscripts to indicate some kind of restriction to that set. The functions defined on Ka(X) may 
be regarded as A- functions as defined in [§]. The Gowers inner product was formalised in [9] in a 
slightly restricted context and generally in [29], where it was called the cube inner product. Here 
we are departing from [8], where a counting rather than averaging convention is used, as we find it 
more convenient not to have to keep track of normalising factors. Although we do not need these 
facts, it may aid the reader to know that for d > 2 the operation / i— > ||/||qi = Oct(/) 2 d defines a 
norm, and there is a Gowers-Cauchy-Schwartz inequality BeO j^n d \ — llseOA ll/slln d - 

5.2 Quasirandom decomposition and the counting lemma 

As for 3-graphs, the above notion of quasirandomness admits a decomposition theorem and a counting 
lemma. 

Suppose as before, that X = X\ U • • • U X r is an r-partite set. A partition A;-system P is a 
collection of partitions Pa of Ka(X) for every A G (<!)■ For each A there is a natural refinement 
of Pa, called strong equivalence in [5], where sets S, S' in Ka(X) are strongly equivalent if Sb and 
S' B belong to the same class of Pb for every B C Ao Given x £ Ki r i(X) the induced complex P(x) 
has maximal edges equal to all sets strongly equivalent to some xa = {xi : i £ A} with \A\ = k. The 
main case of interest is when H is a /c-graph or /c-complex and Pa = {Ha, Ka{X) \ Ha)- 

The following decomposition theorem is a variant of Theorem 7.3 in [8], with two key differences: 
(i) we have an equitable partition of X, together with an exceptional class, and (ii) we want some 
flexibility in the choice of the hidden parameters in quasirandomness, rather than the specific choice 
inherent in the definition of (e, F, /c)-quasirandomness. The proof is very similar to that given in [8], 
so we will defer a sketch of the necessary modifications that give this version to appendix A. 

Theorem 5.1 Suppose < 1/n <C di < ??2 < d 2 < r?3 < • • • < <4_i < ij k <. e, 1/r, l/m < 1, 
that X = X\ U • • • U X r is an r-partite set with each \Xi\ > n, and P is a partition k-system on X 
such that each such that each Pa partitions Ka(X) into at most m sets. Then there is a partition 
k-system Q refining P such that 

• Qa = Pa when \A\ = k, 

— 1/2 

• \Qa\ < d, A ^ when \A\ < k, 

• Qi = Q{i}, 1 < i < r are of the form X®' 1 U • • • U X^' Ul'U-U X™* , where X? = 
Xr U • • • U X?" H -° are exceptional classes of size \Xf \ < e\Xi\, and the X\, I < i < r, t ^ 
are all of equal size, and 

• if x = (xi, ■ ■ ■ ,x r ) is chosen uniformly at random from K^(X), then with probability at least 
1 — e the induced complex Q(x) is r\- quasirandom and d-dense, for some < df. < 1. 

Next, we have the following generalised counting lemma, the adaptation of Theorem 5.1 in [8] to 
the setting of rj-quasirandom d-dense complexes. (The proof is identical.) 

15 Recall that set subscripts indicate an appropriate restriction: Sb = S PI UigsXi. 
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Theorem 5.2 Suppose < 1/n <C d\ < r/ 2 <C d 2 «i|3«- < <4-i < % < e,4, 1/1-^1 < 1, 
and i 7 are r -partite k- complexes on X = X\ U • • • U X r and Y = Y\ U • • • U Y r respectively, and 
H is rj- quasirandom and (F, d) -dense. Suppose Fq is a subcomplex of F, for each e G F we have a 
function f e on K e (X) with f e = f e H e , and f e = H e for all e € F\ Fq. Then 



E, 



■e*(y,x) 



n / e ^ e )) 



.eGF 



<S>(Y,X) 

eGF\F 



II feWe)) 

eeF 



ieJJdeCff). 



eG-F 



As for 3-graphs, there are two special cases of Theorem 15,21 that are particularly useful. Firstly, 
when Fq = we get a counting lemma for the partite homomorphism density of F in H : 



d F (H) = E^g^yjq 



.eGF 



[l±e)\\d e {H). 

eGF 



The further special case when F is a single edge gives an approximation of absolute densities in terms 
of relative densities: 

d{H A ) = (lie) H d B (H). 



BCA 



Secondly, we see that quasirandomness implies vertex-uniformity (the proof is almost identical to 
that given for 3-graphs, so we omit it): 



Corollary 5.3 Suppose 0<l/n<di<n 2 <d 2 <$Cn3<---«: d fc _i < rj k < e, d k < 1 and H 
is an rj-quasirandom d-dense k-partite k-complex on X = X\ U X 2 U X%, with Hi = Xi, 1 < i < k. 
Suppose Wi C Xi, 1 < i < r and let Ht k }[W] be the restriction of JETrw to W = W\ U ■ ■ ■ U Wfc. Then 

\H [k] [W]\ \H [k] \ / l^xl-.-lWifel \ 

IXxi-.-lXfci |Xi|---|x fc | ' Vl*i|---|**l 7" 



5.3 The homomorphism complex: quasirandom counting 

Now we will present our quasirandom version of the counting lemma in full generality. Suppose J is 
a f-partite fc-complex with vertex set E = E\ U • • • U Et and G is an i-partite /c-complex with vertex 
set Y = Y\ U • • • U Yf. We define the homomorphism complex J — > G, which we also denote G' for 
the sake of compact notation. The vertex set is Y' = Y[ U • ■ • U Y^, where Y- is the set of all maps 
(pi : Ei — > Y{. For each 4 C [t], the edges G' A of index A consist of all |A|-tuples (0j)j gj 4 for which 
the associated |^4|-partite map 4>A ■ Ea — ► Ya is a homomorphism, i.e. <Pa(Ja) Q Ga- Note that 
G' = J — > G is formally a t-complex, but its sets of size bigger than k have a trivial structure, in 
that they are present exactly when all their subsets of size (at most) k are present. 

Theorem 5.4 Suppose < 1/n < n 2 <€ e 2 < n 2 < d 2 < "3 < £3 < "3 < • • • < «fc < £fc < 
n( <C l/t,d k < 1, that J is a t-partite k-complex on E = E\ U • • • U E% and G is an rj-quasirandom 
(J, d)-dense t-partite k-complex on Y = Y\ U • • • U Y%, where Gi = Yi, 1 < i < t. Then G' = J — > G 
is rf -quasirandom, where r/' = {rf 2 , ■ ■ ■ ,rj' t ) with 77^ = for k < i < t. Furthermore, the densities may 
be estimated as d A (G') = (1 db e)d j4 (G)l jA l for \A\ < k and ^(G') = 1 for \A\ > k. 



25 



Proof. The argument is very similar to that used in the proof of Theorem [33J We write dj^ = cLa{G) 
and d' A = dA(G') for A C [t], \A\ > 2. Fix A C [t], \A\ > 2. The counting lemma gives 

A \G'X\ P W y)(^)CG) (l±e lAl /3)UseJ<ds 1 |A|J A " 

Next we need to estimate Oct(G' A ). Write G' A = g' Q — g[, where g' = G' A and g[ = d' A G A . We can 
assume that |A| < k: otherwise we have G' A = G£, d' A = 1 and G' A = 0. Then 

Beo A j Beo A 

where we recall that Oa is the ^-octahedron on U, j ranges over all {0, l}-sequences (j_b) bgo a j an d 

We interpret each summand as counting homomorphisms to functions from JJa, where we define 
the clone JJ of J as the following t-partite fc-complex on EE = EE\ U • • • U EEt- The parts 
EEi = E® U Ej, 1 < i < t have two copies x®, x\ of each point Xi G E%. The edges of JJa consist of 
2l A l copies of Ja, indexed by Oa, thus JJa = U B eO A ^A' wri ting J A = {{xf } ie A ■ {xi}ieA G -7a}, 
where xf is x° if -B n Ui = {u®} or x\ li B C\Ui = {u\}. Given a t-partite map u> from C7 to Y', we 
obtain i-partite maps 4> B = u>(B), B G Oa acting on various copies of E in EE, which together give 
a i-partite map (j) from EE to Y defined by <f>{x?A = u(u-){xi). Now we can rewrite the summands 
above using 

n g' JB (ui(B))= n n g SB {^)= n 

sgOa seo A TeJ B TeJJ A 

where we write B(T) for that -B G Oa such that Y G JJ? 3 go = GU (the characteristic function) and 
5i = {d' A fl\ J ^G\ = (l±2\J A \- l e\A\)d A G\. 

Now we can estimate the contribution from the summand corresponding to j = (j_b)bgOa using 
the counting lemma for homomorphisms to G from the complex Kj = JJa< \ ^B-.j B =i Ja ■ We have 

Ewe*(ff,Y0 II Sj B (^{B)) =^ <t>£ <f(EE,Y) II 9j B(T) ((l>(T)) 
Beo A TeJj A 

= (d'A) E %£*(EE,Y)l<t>(K j )tG] 

= ((1 ± e lAl )d l i Al )^ ■ (1 ± e {Al ) dr(G) 

TeK 3 

= (l±0(e|A])) ^(G) = (1±0(6| A |)) n 



Therefore 



oct(G^==x;(-i) Ei (i±o(6| A |)) n 4=E°( e w) n d B<^Ai n ^, 

j Be ° A < 3 Be ° A < B ^°A< 

since <C 77'^,. This proves that G' A is r/|^|-quasirandom, so we are done. □ 
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5.4 An application to generalised Turan problems 

In this subsection we apply the quasirandom counting lemma to derive information about generalised 
Turan problems for configurations that have the following particular structure. Suppose F is a di- 
graph and s is an integer. The s-blowup F{s) of F is defined as follows. For each vertex x of F 
there are vertices x±, ■ ■ ■ , x s of F(s). For each edge x 1 ■ ■ ■ x k of F we have all s k edges x\ • ■ ■ x k with 
l<^i,-- - in F(s). Note that F{2) = FF is the clone of F as defined in the previous section. 

For 1 < s < - 1 we define the s-augmentation F +s of F as the /s-graph obtained from F(s) by 
adding a set V + oik — s new vertices and all edges V + U {x±, ■ ■ ■ , x s }, with x E V{F). The following 
theorem is a generalisation of Theorem l4.1l in two respects: (i) the relationship between a single edge 
and the Fano plane is replaced by the relationship between F and F +s , (ii) the assumptions have 
been relaxed to '^-approximate' assumptions. 

Theorem 5.5 Suppose < 1/n <C 9 <C e <C l/k, 1 < s < k — 1, < <5 < 1, F is a k-graph, H is a 
k-graph on a set X of n vertices, dp+s(H) < 9 and the s-graph G = {S G ( s ) : \Nh(S)\ < 
has density d(G) < 9. Then there is a subset ZCI with \Z\ > (5 — e)n so that dp(H[Z}) < e. 

Proof of Theorem 15.51 We operate with a constant hierarchy 0<l/7i<C#^a<C<ii<?C7/2<S 
d 2 < r/3 < • • • < 4_i < 7/ fc < 7, 4 < uj < e < 1. Set r = \lu~ 2 } and let X = X 1 U • • • U X r be 
an arbitrary partition with \\X{\ — n/r\ < 1 for 1 < i < r. Let H\ be the edges of H that respect 
this partition, i.e. have at most one point in each JQ. Consider the partition system P naturally 
associated with Hf, in which Pa is the partition Ka(X) = {H^)a U Ka{X) \ (H^)a- Applying 
Theorem 15 .1\ using 7 instead of e, we obtain a partition system Q refining P such that 

• Qa = Pa when \A\ = k, 

— 1/2 

• \Qa\ < d^ when \A\ < k, 

• Qi = Q{t}, 1 < i < r are of the form X-' 1 U • • • U if™ 1,0 U X/ U • • • U where X? = 
X?' 1 U • • • U X-' 7 ' H -° are exceptional classes of size \X®\ < 7[JQ|, and the X*, 1 < i < r, t ^ 
are all of equal size, say m, and 

• if x = (xi, ■ ■ ■ ,x r ) is chosen uniformly at random from jK"r r i , then with probability at least 
1 — 7 the induced complex Q(x) is r/-quasirandom and (i-dense, for some < < 1. 

Consider the set of all s-tuples contained in the vertex classes of Q: denote it Y = U^wYi^, 

where I = {(i,ti) : 1 < i < r, 1 < U < m} and Y i)U = ( x ^). Let W = ( fc *J and W = {R £ W : 
\N H (R)DY\ > (5 -uj)\Y\}. Double counting gives 

(\Y\ - 9n s /s\)d( n y) < d H(S) = Yl \ n h(R) H Y \ G\ < \W'\\Y\ + \W \ W'\{5 - u)\Y\, 

^ S ' S&Y\G Rew 

which gives \W'\ > |^( fc "J- Write C R = {(i,U) € / : \N H (R) nY i>t .\ > Since a <C d x there 

are at most 2^ < a -1 / 2 possibilities for Cr, so there is some C that occurs as Cr for at least a( fc " s ) 
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of the R in W. For now we focus on one such R. Then, since [JC*| = m for every (i,ti) E /, 

(*- u )|/|(7) <!»„(«) nr|< £ (7)+ £ .(7), 

so |C| > (<5 - 2w)|J|. Let Z = U (Mi)gC X*\ Then |Z| = [C|m > (5 - 3u)n. 

Suppose V(F) = {v 1 , • • • , v f } for some t. We think of F- as a i-partite fc-complex with one vertex 
in each part. For any (ordered) i-tuple of vertices x = (x\,--- ,x t ) with x a E X* a , (i a ,t a ) € C, 
i a distinct, 1 < a < t we consider the induced t-partite /c-complex Q(x), i.e. the union of the 
induced complexes corresponding to each A;-subset of x. For convenient notation we temporarily 
identify ■ ■ ■ , it) with [t] = {1, • • • , t}. We let H(x) C Q(x) be obtained by keeping those A;-tuples 
corresponding to edges of H , i.e. H{x)a = Q{x)a if Q{x)a Q Ha and H(x)a = if Q{x)a Q Ka\Ha, 
for A € We claim that -ff(x) is not both 77-quasirandom and (F-, <i)-dense. Otherwise, we can 
calculate as follows that there is too high a probability that a random map <f> : V(F +S ) —* X is a 
homomorphism from F +s to -ff. Using the definition of F +s , we write the event {(p(F +s ) C iT} as 
.Ei fl H F3 H F4, where E\ is the event that the additional vertex set V + is mapped to some R in 
W with Cr = C, E2 is the event that V(F(s)) is mapped in r-partite fashion to the classes X*", 
1 < a < t, E3 is the event that each s-tuple vf ■ ■ ■ is mapped to Nh(R) fl Y ia>ta for 1 < a < t, 
and F 4 is the event {<j>{F{s)) C if}. Now P(Fi) > a/2 by definition of C, P(F 2 |Fi) = {m/n) ts > a 
and P(JS? 3 [£7 a n E 2 ) > (w/2)* by definition of Cr. Also, Theorem El and Corollary O applied to 
J = F(s)- give 

P(E 4 |£in£ 2 n£ 3 ) = (i±e) n ^(^W) > \ II > a - 

AeF(s)< AeF< 

We deduce that P[0(F +s ) C iJ] > a/2 • a ■ (oj/2) 1 ■ a > 9, which contradicts our assumptions. 
Therefore H(x) is not both r/-quasirandom and (F-, d)-dense. 

Now we want to estimate = P/ : .e-»z(/(F) C ff[Z]). The event /(F) C is 

contained in the event -Bi U B<i U -B3 where B\ is the event that / is not a partite map (i.e. two 
vertices of F are mapped to the same X), B2 is the event that / is a partite map but F is mapped 
to H{x) for some i-tuple x that is not r/-quasirandom, and -B3 is the event that neither B\ or B<i 
holds and /(F) C H. Then P(Si) < (*) jr < u. Also, Q was chosen so that at most -yn edges of H\ 
belong to induced complexes which fail a quasirandomness condition, so P(£?2) < \E{F)\"f < uj. To 
estimate P(-Bs) we note that on this event / maps V(F) to some H (x) that is Ty-quasirandom, so by 
the previous paragraph it is not (F- , d)-dense. Since \Qa\ < ^\a\ 2 wnen l^-l < ^> f° r an y a* < k, &t 
most {^)d\n k edges of Hi belong to an induced complex in which some a-graph has relative density 
at most d a . Also, at most d^n k edges of Hi belong to an induced complex in which the /c-graph has 
relative density at most dk- Summing these contributions, we deduce that ¥(B^) < u. Finally we 
have d F (H[Z}) < P(Fi) + P(F 2 ) + P(B 3 ) < 3w < e, as required. □ 

Remark. As a further extension, note that we could have taken any I' C I and applied the argument 
with Y' = ^(i t ti)ei'^i,U instead of Y. Thus we find some C C /' with \C'\ > (5 — such that 
Z' = U(i tti ) eC 'Xf satisfies \Z'\ = \C'\m > ^r(<5 - e)n and d F (H[Z'}) < e. 
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6 Codegree problems for projective planes 



In this section we apply Theorem 15,51 to codegree problems for projective planes. The first sub- 
section contains definitions and a summary of previous results on codegree problems for projective 
planes obtained in [15]. The second subsection generalises the approach used for the Fano plane to 
obtain strong structural information for the general problem, which already determines the codegree 
threshold for planes over a field of odd size up to an additive constant (the first part of Theorem 
II. 2p . In the third subsection we complete the proof of Theorem 11.21 with a more detailed analysis 
for the plane over F3 in terms of its blocking sets: we see that the true codegree threshold in this 
case lies strictly between the natural upper and lower bounds found in the first subsection. The last 
subsection deals with the plane over F4, where we demonstrate a surprisingly different behaviour 
from F2 and fields of odd size. 

6.1 Definitions and previous results 

First we give some definitions. Let ¥ q be the field with q elements, for any prime power q. The 
projective geometry PG m (q) of dimension m over ¥ q is a (g+l)-graph with vertex set equal to the one- 
dimensional subspaces of F™ +1 and edges corresponding to the two-dimensional subspaces of ¥ q n+1 , 
in that for each two-dimensional subspace, the set of one-dimensional subspaces that it contains is an 
edge of the hypergraph PG m (q). A vertex of PG m (q) can be described by projective co-ordinates as 
{x\ : • • • : x m+ i), where (x±, • ■ ■ , x m+ i) is any non-zero vector of ¥ q n+1 and [x\ : • • • : x m+ i) denotes 
the one-dimensional subspace that it generates. 

The main result of [15] is the following upper bound on the codegree density for general projective 
geometries, which is tight in many cases. 

Theorem 6.1 The codegree density of projective geometries satisfies TT q (PG m (q)) < 1 — 1/m. Equal- 
ity holds whenever m = 2 and q is 2 or odd, and whenever m = 3 and q is 2 or 3. 

The results in [15] can be summarised by the following table, in which the entry in the cell indexed 
by row m and column q is either a number indicating the exact value of Tr q (PG m (q)) or an interval 
in which -K q (PG m {q)) lies. 



m\q 


2 


3 


4 


2*, t > 3 


p , p odd 


2 


1/2 


1/2 


[1/3,1/2] 


[0, 1/2] 


1/2 


3 


2/3 


2/3 


[1/2,2/3] 


[0,2/3] 


[1/2,2/3] 


4 


[2/3,3/4] 


[2/3,3/4] 


[1/2,3/4] 


[0,3/4] 


[1/2,3/4] 


m > 5 


[3/4,1 - 1/m] 


[2/3, 1 - 1/m] 


[1/2,1 -1/m] 


[0,1 - 1/m] 


[1/2,1 -1/m] 



6.2 General structure 

A useful property of projective geometries proved in [15] is that PG m (q) C PG m -i(q) +q ; in particular 
PG2(q) C e +q , where e is a single (q + l)-edge. Thus we may specialise Theorem 15.51 as follows. 
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Theorem 6.2 Suppose q is a prime power, 0<l/n<^9<^e<^l/q,0<5<l,Hisa (q + l)-graph 
on a set X of n vertices, d PG2 ^(H) < 9 and the q-graph G = {Q € ( ) : \Njj(Q)\ < Sn} has density 
d(G) < 9. Then there is a subset ZCI with \Z\ > (5 — e)n so that d(H[Z]) < e. 

Similarly to Theorem 14. 21 we can use this to deduce a stability result, describing the approximate 
structure of a (q + l)-graph that does not contain PG 2 (q) and has most of its g-degrees at least 
n/2 - o(n). @ 

Theorem 6.3 Suppose q is a prime power, 0<l/n<^.9<^.e<^.l,Hisa(q + l)-graph on a set X 
of n vertices, dpQ 2 ^{H) < 9 and the q-graph G = {Q € ("^) : \Njj{Q)\ < (1/2 — 9)n} has density 
d(G) < 9. Then there is a partition X = A U B such that at most en q+1 edges are contained entirely 
within A or within B. 



Before giving the proof we remark that any /c-graph H with density d(H) > 1 



k ) contains 

K^, the complete fc-graph on m vertices. Indeed, by averaging H contains a set M of m vertices 
with d(H[M]) > d(H), and then every A:-tuple in M must be an edge. (In other words we are using 
the easy bound 7r(-K^J < 1 — (T) 1 on the Turan density, which is far from being best possible, but 
suffices for our purposes here.) 

Proof. Introduce a hierarchy of constants e = #4 S> 9 3 S> 9 2 3* 61 = 9. By Theorem 16.21 we can find 
iCI with \A\ > (1/2 - 9 2 )n and d(H[A\) < 2 , i.e. \H[A]\ < 9 2 \A\ q+l /{q + 1)!. Let B = X \ A. 
Now 



(q + l)\H[A}\+ Y, \N H (Q)nB\= Y, Wh(Q)\> 



A 



\G 



(l/2-0)n> (1/2 -9 2 )n 



\A\ 



so 



E 

beB 



A~/7<'»n(f) = Y \NH(Q)r\B\>(l/2-9 2 )nO A ^-9 2 \A\ q+l /q\>(l/2-9 z )nO A][ \ 
J Q*0 K<1J 



This implies that \B\ > (1/2 - 9 3 )n, so \A\ and \B\ are (1/2 ± 9 3 )n. Let 



Bo, 



Then 



(1/2 - 9 3 )n 



\A 



beB: 



N H (b) n 



A 



<(1 



< 



E 

beB 



\B\ 



N H {b) n 



A 



A 



B \-9 



< \Bn\(l 



i/i(\A 



9l /2 : 



\A 



\A 



+ \B\B 



so 



l-Bol < 



9~ 1/2 (\B\ - (1/2 - 9 3 )n) < 20* , 



16 This weakened assumption may strike the reader as strange at first sight, as in a straightforward application of the 
stability method one has information about all g-degrees, but its importance will become clearer in the third subsection. 
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We claim that there is no edge e E H[B\ Bq]. If there were we would have | C\ bee ^n(b) H (^) | > 

(1 - (q + l)6»3 /2 )( 1 ^ 1 ). But then, by the remark before the proof f] bee N H (b) n (^) contains a K%, 
and together with e we have a copy of -PG^g). Therefore every edge of B contains a point of Bq, 
so < I.BoK'f ) < 2#J /2 ra((l/2 + 3 )n) 9 /g! < \en q+1 . Since |iTL4]| < 6> 2 1 Q+1 / + 1)! we are 

done. □ 

Finally, just as in the proof of Theorem 11.11 we use the previous stability result to obtain very 
precise information about a {q + l)-graph on a set X of n > tiq vertices with minimum g-degree 
Sq(H) > n/2 that does not contain PG2(q)- 

Theorem 6.4 For any prime power q there is a number uq so that if H is a (q + l)-graph on a set 
X of n > no vertices with minimum q-degree S q (H) > n/2 that does not contain PG2(q) then n is 
even, there is a partition X = Xq U X\ where \Xq\ = \X\ \ = n/2 and H[Xq\ = H[X\] = 0. 

Proof. Let e be sufficiently small and n > no(e, q) sufficiently large. By Theorem 16.31 we have a 
partition X = Xq U X\ so that |ffLYo]| + |//LYi]| < en q+1 . Choose the partition that minimises 
\H [Xq] I + \H [X\] | . We will show that this partition satisfies the conclusion of the theorem. Note first 
that the same argument used in the proof of Theorem 16.31 shows that \Xq\ and \Xi\ are (1/2 ± 2e)n. 

First we show that there is no vertex x € Xq with degree at least e x l 2q n q in iT[Xo]. For suppose 
there is such a vertex x. By choice of partition we have \Njj{x) D > \Nh(%) H ( °)| > 

e l l 2q n q ) or we could reduce |iJ[Xo]| + by moving x to X\. We can choose matchings M = 

{x\ ■ ■ ■ x\ , ■ ■ ■ , xl ■ ■ ■ x q m } in N H (x) n (*°) and M' = {y\ ■ ■ ■ yf, ■ ■ ■ , y l m ■ ■ ■ y q m } in N H (x) D (^) , with 
m = e l / 2q n/q (as in the case q = 2 we are using the well-known observation that any maximal 
matchings will be at least this large). Now 

E E \ N H[x o] (4l ■■■<*)! < + < (q + l)en q+ \ 

i={i u ... ,i q }e(}™ ] ) J=ih, - ,j q }e[q] q 

so we can choose I € ( ^ ) such that 

^ \N H[Xo] (4 ■ ■ ■ < ? )l < Q \q + l)™ q+1 < 2q q (q + l)\e 1 ' 2 



J={h,- J q }&[q] q 



n. 



Also, for every 1 < k < m, H cannot have all q q+1 edges x 3 ^ ■ ■ ■ xl q y J k q+1 , with ji, ■ ■ ■ ,j q +i in [q], 
as then together with x we have a (/-augmented edge, which contains PG2(q)- Therefore there is 
some g-tuple Q = xj^ ■ ■ ■ x 3 ? such that there are at least q~ q m vertices lf k +1 that do not belong to 
N H {Q). This gives \N H {Q)\ < 2q q (q + l)!e 1 / 2 n + |Xi| - q~ q - l e l l 2q n < (1/2 - e)n, which contradicts 
our assumptions. We deduce that there is no vertex in Xq with degree at least e 1 l 2q n q in i7[Xo]. 
Similarly there is no vertex in X± with degree at least e x l 2q n q in 

Write |Xo| = n/2 + t and \Xi\ = n/2 — t, where without loss of generality < t < 2en. Suppose 
for a contradiction that either t > or t = and there is an edge in i?[Xo] or Note that for 

every q- tuple Q G ( Xo ) we have \N H (Q)nX \ > \N H (Q)\- \Xi\ > t. Thus we can assume there is at 
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least one edge in Xq (since the case t = is symmetrical). Let Yi C Xj be minimum size transversals 
of H[Xj\. Then Yq ^ 0. Also, by the previous paragraph 

|y k 1/2 % 9 > E I^W*)! ^ = r^y E I^W^I > - 

x&o q Qe( x °) 

so |F | > £~ l,2q ^Ti n ~ q { n,2 q +t ) > 2 <l 3q t, say. 

For each edge e £ Xq consider all possible ways to extend it to a copy of PG2(q) using some 
F G ( q i)- Since H does not contain PG2(q) there is some (q + l)-tuple with q points in F and one 
point in e that is not an edge. We count each such (g + l)-tuple at most ( 2 _ ? ) times, so we get a 
set of at least ('^i^ 9 ) ('^2') distinct (g + l)-tuples. Thus there is some point x in e for which we 

have a set M x of at least ('^i" 9 ) ('^p') > 9~ 3 '(''^ 1 ') 'missing' (g + l)-tuples involving x and a 
g-tuple in ( x ) not belonging to Nh(x). Now varying e over all edges in Xq we get a set M = U X M X 
of missing (q + l)-tuples with |M| > \Y \ ■ q' 3q { ]Xl1 ) ( since Yq is a minimum size transversal). For 
each g-tuple Q £ ( x ) we have 

n/2< |iMQ)| = |AT H (g)nXo| + |iVH(Q)nXi| < |X |-|iVM(Q)| + |iVH(Q)nX 1 | 

so 

(g + i)!^^]! = E I^M<2)nXi| > E (\N M (Q)\-t) = \M\-t 

Qe( x /) Qe(^) 

Since (T^lYoM > §<r 3<? N we have > ^|Y |(I^I). Also [Y^e 1 /^ > |JV m] (x)| > 

so mi > e-V^-g^iyoln-'C/J"*) > 2|Y |. 

Finally we can apply the argument of the previous paragraph interchanging Xq and X\ . We get 
a set M' of at least \Y\ \ ■ g _39 ('^ ') distinct (q + l)-tuples that are not edges, each having q points 
in Xq and 1 point in X\. For each g-tuple Q C Xq we now have \Njj(Q) n A~o| > + so 

|H[X ]| > ^T-(q~ 3q \Yi\+t)( X q ) and |F | > e-^n^-HMI > 2|Y X |. This contradiction completes 
the proof. □ 

Remark. The argument applies more generally to any F with PG^g) CFC e +q (where e = PG\{q) 
is a single edge.) 

For general odd q Theorem 16 . 41 determines the g-degree threshold to find PG2(q) up to a constant, 
thus proving the first part of Theorem 11.21 It provides an upper bound ex 9 (n, PG2(q)) < n/2. 
On the other hand, it was proved in [15] that there is no copy of PG2{q) in the complete oddly 
bipartite (g + l)-graph, by which we mean the construction obtained by forming a balanced partition 
X = Xq U X\ and taking as edges all (g + l)-tuples with an odd number of points in each X{. This 
gives a lower bound ex g (n, PG^g)) > [n/2\ — g + 1. 

6.3 The projective plane over F 3 

To nail down the constant uncertainty in the bound ex g (n, PG^g)) > [n/2\ — g+ 1 for odd g requires 
analysis of the degrees of g-tuples not contained in Xq or X\ , which is closely connected to the theory 
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of blocking sets in projective planes (see [30]). This theory is far from complete, but the case q = 3 
is sufficiently simple to analyse. 

Say that S C PG m (q) is a blocking set if < \S Pi L\ < \L\ for every line L of PG m (q)- Note 
that the complement of a blocking set is also a blocking set, so the existence of a blocking set 
is equivalent to the existence of a bipartition of PG m {q). The blocking sets of P(j?2(3) may be 
classified as follows (see |30| or |10|): they all have size 6 or 7, and those of size 6 are of the form 
L(x, y) U L(y, z) U L(x, z) \ {x, y, z}, where x, y, z are non-collinear points and we use the notation 
L(a, b) for the line containing a and b. (Those of size 7 are their complements.) 

Consider a partition V(PG 2 {3)) = A U A 1 of 
-PG2(3) into blocking sets, where A\ = L(x,y) U 
L(y, z)UL(x, z)\{x, y, z}. We refer to the type of a 
line L as A^A^ if \LC\ AA = t iy i = 0,l. Then the 
lines L(x,y), L(y,z) and L(x,z) each have type 
j4qj4^, and their restrictions to each part form a 
triangle in Ao and a matching in A\. There are 6 
lines of type -Aq-Ai, and we note that the 3 points 
of Aq consist of 1 point from {x, y, z} and 2 of the 
other 4 points. This leaves 4 lines of type AqA^, 
for which we note that the point in Aq is not one of 
{x, y, z}. In the picture, we have x = 100, y = 010, 
z = 001, Aq = white discs, At = black discs. 

Construction. Now we can describe an improvement to the complete oddly bipartite construction. 
Take a set X of n points, n even, and partition it as X = Xq U X\ with |Xo| = n/2 + 1. Choose 2 
special points a, b in Xq. Form a 4-graph H by taking as edges all 4-tuples which either have 3 points 
in one Xi and 1 point in or have 2 points in X\ and 2 points in Xo, exactly one of which is a 

or b. Then H does not contain PG2(3), as the edges with 2 points in each part do not contain the 
triangle with matching neighbourhood configuration described in the previous paragraph. Also, the 
minimum 3-degree of H is n/2 — 1. 

We will show that this construction is optimal. First we need the following lemma, which is 
similar to (but does not follow from) a case of a result of Diwan and Sobhan Babu [5] . The proof is 
a simple but slightly tedious case analysis which we outline here for the reader's convenience. 

Lemma 6.5 Suppose G\, G2 and G3 are graphs on the same set Y of at least 8 vertices, so that 
each has minimum degree at least 2 and there is no 'multicoloured' matching e\, e 2 , e-$ with ei G G{ 
for each i. Then there are two points a, b that meet each edge of each Gj. 

Proof. Suppose that no two points meet each edge of each G{. We claim that there is a matching 
M of size 3 in G = G± U G% U G3. To see this, consider a possible counterexample G. Divide into 
cases according to the connectivity k(G) of G. The minimum degree condition implies that each 
component of G contains a matching of size 1 (i.e. an edge), and even a matching of size 2 unless 
it is a triangle, so G must be connected. If k,(G) = 1 and v is a cutvertex then every component of 
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G\v contains an edge, so G \ v has exactly two components C, C. Since there is no matching of 
size 3, at least one, say C, is a star with at least 4 vertices, i.e. its edges all contain some vertex x 
in C. By the minimum degree condition any other vertex y in C is joined to v. But now we can 
find a matching of size 3: take ya, xz for some z £ C \ {x, y} and an edge in C'. Next suppose that 
n(G) = 2 and {u, v} is a cutset. By hypothesis {u, v} does not meet every edge, so some component 
C of G \ {u, v} contains an edge. If there are at least two other components C, C" of G \ {u, v} then 
we can extend this to a matching of size 3 with an edge from C to u and an edge from C" to v. If 
there is one other component C' of G \ {u, v} then we can find a matching of size 3 in which there 
is an edge from C to u or v, from C to the other of u or v, and a third edge in C . Finally, suppose 
n{G) > 3 and S is a minimum cutset. Since every x £ S has a neighbour in every component of 
G \ S there are exactly 2 components of G \ S. Then we can find a matching of size 3 using two 
edges from S to components of G \ S and one edge inside a component of G \ S. In all cases we see 
that there is a matching M of size 3 in G = G\ U G2 U G3. 

If not all three colours Gi are used in M we will show how to increase the number of colours. 
Suppose first that all three edges are in G\. Pick any new point x and an edge e of G2 that contains 
it. We can include e and discard whichever edge of our original matching meets it (or any if none 
does) to obtain a new matching M' of size 3 on which the colour G2 also appears (on edge e). 
Now any new point y must be incident to exactly two edges of G3, joining it to the endpoints of e 
(otherwise we would have a multicoloured matching). However, we could take another new point z, 
an edge e' in G2 containing z, and include e' in a new matching (discarding an appropriate edge). 
Now we have either 2 edges of G\ and 1 of G2 (if e was discarded) or 2 edges of G2 and 1 of G\ 
(otherwise). Either way, the same reasoning as before tells us that every new point y' is incident in 
G3 to exactly to the endpoints of /, where / ^ e is some edge of the new matching, contradicting 
the fact that it is incident exactly to the endpoints of e. □ 

Theorem 6.6 // n is sufficiently large and H is a A-graph on a set X of n vertices that does not 
contain PG2(3) then 63(H) < re/2. 

Proof of Theorem 16.61 Suppose for a contradiction that 63(H) > re/2. By Theorem 16.41 we have 
n even, X = X U X x with |X | = |Xi| = n/2 and H[X ] = H[Xx} = 0. Also, for every 3-tuple 
contained in one of the X{, to get minimum 3-degree n/2 every 4-tuple obtained by adding a point 
from Xi-i must be an edge. For every pair a, b in Xq we have a graph G a ^ = Nn(a,b) n ( J ^ 1 ) on 
X\ with minimum degree at least 2. These graphs do not contain a 'triangle-coloured matching', i.e. 
a triple o, b, c in Xq and a matching ei,e2,e3 in X\ with e\ € Gb tC , e2 € G ajC , e% € G a ,b- For every 
4-tuple with 3 points in one Xi and one point in X\^i is an edge, so using the description of the 
blocking sets in PG2(3), any triangle-coloured matching could be completed to a copy of PG2(3), 
contrary to assumption. By the lemma there must be two points a±, b\ in X\ that meet every edge 
of each of Gfe ]C , G ajC and G a ^. In fact oj, b\ must meet every edge of G a ip for any pair a', b' in Xo, 
as may be seen by applying the previous reasoning in the triangles a'b'a and b'ab (without loss of 
generality). Similarly there are two special points ao, bo in Xq that meet every edge with 2 points in 
each Xi. But now any triple a, b, c with say a, b in Xq \ {ao, bo} and c in X\ \ {01, 61} is not contained 
in any edge with 2 points in each Xi, so has 3-degree equal to n/2 — 2. This contradiction completes 
the proof. □ 
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Figure 3: PC2(4) and the second embedding used in the proof. 
6.4 The projective plane over F 4 

We conclude by demonstrating a somewhat surprising phenomenon for the projective plane over F4: 
its codegree density is less than 1/2 — c for some absolute c > 0, unlike the cases of F2 and F 9 , q odd 
where the codegree density is 1/2. 

Before proving our bound we need some information about the blocking sets of -PG2(4). A 
classification was given in [lj, but we will just need two specific examples. For the first, suppose 
more generally that q is a prime power. Then one blocking set in PG2{q 2 ) is a Baer subplane B, 
which may be constructed by restricting to those points (xq : x\ : X2) that have some representative 
(00,01,02) with each in the base field ¥„■ Each line of PG2(q 2 ) contains either 1 or q + 1 points 
of B. The intersections of size q + 1 in B form the lines of a copy of PG2(q)- Also, since every pair 
of lines in PG2(q) intersect, the lines of PG2(q 2 ) containing them do not intersect outside of B. For 
the sake of being more concrete, we remark that this can be described more explicitly in PC2(4) 
using the representation PG2(4) = {^4 + x : x € Z21}, where A = {3,6,7,12,14}. An example 
of a Baer subplane is B = {x : x = mod 3}. Dividing by 3 we can represent the lines of B as 
{A 1 + x : x G where A 1 = {1, 2, 4}: a well-known description of the Fano plane. 

We also need to consider the blocking set A = L(x, y)\JL(x, z)U{w}\{y, z}, where x, y, z are 3 non- 
collinear points, and w € L(y, z). Consider the associated 2-colouring Co = A, C\ = V(PG2(4)) \Co- 
Recall that the type of an edge L is (j^ LnC o\ (j\L^Ci\ ^ ^here are 2 edges of type CqC|, 9 of type 
ClCf, 3 of type CqC 2 and 7 of type C^Cf. For the CqC 2 edges the triples in C can be described 
as wdid'i with i = 1,2,3, where L(x,y) = {x, y, d\, cfo, ^3} and L(x,z) = {x, z, d' t , d' 2 , d' 3 }. For the 
CqCi edges the triples in C\ can be described as eie2e3, aeie[, beie" with i = 1,2,3, where we write 
L(y,z) = {y,z,w,a,b}, L(x,w) = {x,w,et,e 2 ,e 3 }, L(a,ei) \ (L(x,y) U L(x,z) U {a,ej}) = {e' { } and 
L(6, ej) \ (L(x, y) UL(x, z) U {b, e^}) = {e"}. Figure [3] shows an incomplete representation of PG2(4), 
with Co represented by black discs and C\ by white discs. 

The following statement generalises Theorem II. 3} in that it makes an allowance for a small 
exceptional set of small codegrees. 
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Theorem 6.7 For any e > sufficiently small there is 6 > and uq so that if H is a 5-graph on 
a set X of n > no vertices and G = {Q £ ( . 1 
dpG 2 (4)(H) > e. 



\N H (Q)\ < (1/2 - 6)n} satisfies d(G) < 9 then 



Proof. Let €5 be a small constant (say 10 -6 ), introduce a hierarchy of constants £5 3> £4 3> • • • €3 S> 
(2 3> eo 3> e ^ # and suppose n is sufficiently large. Suppose for a contradiction that <i(G) < 
but dpG 2 u\(H) < e. By Theorem 16.31 we can find a partition X = Xq UXi so that \Xq\ and are 
(1/2 ± e )n and \H[X }\ + |fT[Xi]| < e n 5 . 

We introduce two 3-graphs for i = 0,1: 



Ji 



e G 



N H (e) n 



2 



1*1- 
2 



We claim that dpQ 2 ( 2 )(Ji) < £2- For suppose otherwise, say that <ipG 2 (2)(^o) > £2> and let be 
a random map from V(PCr2(4)) to X, conditioned on the event E(4>) that a Baer subplane B is 
mapped to Jo, and the other points are mapped to X\. We estimate the probability that <f> embeds 
PG2(4) in H. There are 2 types of edges to consider: 14 of type X^Xf and 7 of type X^Xf. Suppose 
L is an edge of PG2(4) that we are attempting to embed with type X^Xf. For each point a in Xq 
let m(a) be the number of 4-tuples Q £ ( x }) such that a U {Q} is not an edge of H. We have 



(1/2-^(^1)-^)) < £ |iMQ)|=£ 



+ 5|ff[Xi] 



X x 
4 



< l- v «)l( '"",")- E m ^ + 5e ° 



so ZaeXo m ( a ) < iOeo"- 5 - Now 



F(<P(L) i H) 



and since there are at most 
f|Xo|-l 



£ W)n^o = {a})m(a)|X 1 |- 4 , 

a€X 

copies of PG2(2) in Jo that use a we have P(4>(L) HXq = {a}) < 
Pfj -1 )/^ 1 * ') = sfa' so F ^( L ) t H )< Jjfcl ' iOeo^l^il" 4 < 500062 V We are attempting 
to embed 14 edges of type X^Xf, so the probability that any fails is less than 10 5 e 2 ~ 1 eo. On the other 
hand, we have conditioned on the event that the Baer subplane B is mapped to Jo, and so by definition 
each of the 7 attempted embeddings of type X^Xf ^ s successful with probability at least e\ — 0(l/n). 
Furthermore, if Ti, • • • , T7 are the triples of the Baer subplane then Np@ 2 u\ (Pi), • • • , N P q 2 ^ (T7) 
are disjoint sets, so the events are independent. With probability 1 — 0(l/n) their images under cp 
are disjoint, so we estimate P(0(PG 2 (4)) Q H) > e[ — 0(l/n) - 10 5 e 2 % > e 1 / 2 , say. Then, letting ip 
be a random map from V(PG 2 (4)) to X, we have d PG2(4) (H) = P(V>(PG 2 (4)) C H) > P[P(^)]e 1/2 > 
e2(l/4) 14 e 1 / 2 > e. This contradiction shows that rfpG 2 (2)(^i) < e 2- 

Next we introduce 2-graphs for i = 0,1: 



P 



e ( 2 l ) with iJVj^Oija^ 



< (l/2-e 2 )|Xi 
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We claim that |p| < €2( 2 ) f° r a ^ l eas t one of i = 0,1. For suppose otherwise, and consider a 
4-tuple Q = {ao, a' Q , a±, a^}, where {aj,o^} are chosen uniformly at random from Pi, i = 0,1. We 
have E\N H (Q)\ =E\N H (Q)nX \+E\N H (Q)nX 1 \. Now 



E\N H (Q)nX \=E 



10, a / ^ 

aeX 



(a G iYH(Q)|a ,ao) 



and P 0l) ^(a G 7V^(Q)| 0(h a ' ) = P({ai,oi} G N H (a Q , a' , a)). For a G iV/ (ao,a ) we estimate 
this probability trivially by 1, but for a G Xq \ Nj (ao,a' ) we use the definition of Jo an d the 
lower bound on Pi to estimate ¥({ai,a[} G Nn(ao, a' , a)) < eie^ 1 • This gives E\Nh(Q) n Xo < 
iNjiao^^ + e^lXol < (1/2- e 2 /2)\X \. Similarly we estimate E\N H (Q)(^Xi < (l/2-e 2 /2)\X 1 \, 
so E\N H (Q)\ < (1/2 - e 2 /2)n. However, we also have E\N H (Q)\ > P(Q (£ G) ■ (1/2 - 0)n, and 



G G) < 



IGI 



< 



6>n 4 /24 



1:2 



<e, soE|^(Q)| > (l-e)(l/2- 



77. 



This contradiction shows 



that at least one Pi is small, say |Pq| < £ 2 



1*0,) 



Now we can apply Theorem 16.31 to find a partition Xq = AqU Bq where |^4o| an d |-E>o| are 
(1/2 ± e3)|Xo| such that at most e3|Xo| 3 edges of Jo are contained entirely within Aq or within Bq. 
Next we repeat the argument to deduce similar structural information on J\. Let 



Pq = { {a ,a' } G 



A 
2 



with \Nj (a ,a ) DA \ < 400e 3 |X | 



P[ = ^{a l ,a[}G with iNj^auati] < (1/2 - e 4 )|X 1 || . 

We must have |Pq| > 5(2); otherwise 

e 3 |X | 3 > |(J )aoI = ~ E \Nj (a ,a' Q )nA \>^ A ^yM0e 3 \X \, 

ao,a' £Ao 

which is a contradiction. Now we cannot have \P[\ > e^'^ 1 ') , as then considering Q = {ao, a' , a\, a[}, 
where {oj,a^} are chosen uniformly at random from P(, i = 0,1, we estimate (similarly to before) 
E\N H (Q)nXo\ < (l/2 + 500e 3 )|Xo| + eier 1 |^o| and E\N H (Q) D X x \ < (1/2 - e 4 )|Xi| + 10ei|Xi|, so 
E\Nh(Q)\ < (1/2 — €4/2)77, contradiction. Again, by Theorem 16.31 we find a partition Xi = A\ U B\ 
where \A\\ and are (1/2 ± es)|Xi| such that at most es|Xi| 3 edges of J\ are contained entirely 
within A\ or within B\. 

Let T { count edges of H of type AqA\B}, i = 0, 1. We can bound Tj by summing degrees of 
quadruples Q = {ao, a' , ai, a'i} with {ai,a[} G ( 2 *) : 



(l/2-e)n {( ]A f){ ]A f) -fn 2 /24) < T 0+ Ti+3 £ 



N H {e) n 



+3 E 



iV/f(e) n 



At 



Now 



E 



%(e)n 



A 



< \ Wa \ 



Ax 
2 



+ 



\A C 



Ai\ 
2 
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and 



N H (e) n 



A 
2 



2 1 V 3 / \ 2 J \ 2 \ 2 



so T + Ti > (1/2 - 40e 5 )n(^ 01 ) Therefore 



> (l-400e 5 )|P 



2 J V 2 / V V 2 / V 2 J v ' u/ V 2 J \ 2 



2 \ 2 



By symmetry, similar bounds hold for the number of edges in each case when we specify a triple in 
one X{ respecting the partition (Ai, Bi) and a pair in A\-; L or i.e. the types A\B\A\, A\B\B\, 

A\BlAl AlBlBl A\B\Al A\B\Bl A\B\Al A\B\Bl 

Now we find PG2(4) using the A colouring. Let <p be a random map from V(PG2(&)) to X, 
conditioned on the event E'(<f)) that w,x are in Aq, y, z,e\,e2,e' i ,e'- are in A\, d{,d\ are in Bq and 
e3,a, 6 are in B\ [i ranges from 1 to 3). Note that M = minjl^oli l^o| 5 |^i|} > (1/4 — 2e^)n. 
There are 11 attempted embeddings of type Xfx\_^ Recalling that m{a) is the number of 4-tuples 
Q £ ( J ^ 1 ) such that a U {Q} is not an edge of H, we estimate that the probability that we fail to 
embed some such L is at most 11 J2 a ex 1 - i m(a)M~ 5 < 10 6 eo- There are 10 attempted embeddings 
that have the type discussed in the previous paragraph, i.e. one of the types equivalent to A^B^A\ . 
Each fails with probability at most 400e5, so the probability that any fails is at most 4000e5. Now a 
random map ip from V(PCr2(4)) to X satisfies E'(ip) with probability at least (1/5) 21 and so succeeds 
in embedding PG2(4) in H with probability at least (l/5) 21 /2 > e. This contradiction completes 
the proof. □ 

Remark. The question of what happens for PG2(2 S ) in general is intriguing. It seems plausible that 
the above approach of going from PG2{2) to P(?2(4) could be adapted to an inductive argument 
when s = 2 r is a power of 2. Much of the argument would go through as above: our hypergraph 
has an approximate bipartition X = Xq U X\ and Jj = {e € ( 22 r X i +1 ) : \Nn(e)\ > e\ (^i^^'-i) } 
satisfy d pG ^ 22 r- i^(Jj) < £2- The step that may fail is finding Q of low degree: our approach used 
the convenient coincidence of 2 • 2 = 2 , which only occurs for r = 1. 



7 Future directions 

The basic form of our hypergraph regularity method has been well illustrated by its application to 
projective planes, which are relatively easy to deal with (for reasons yet to be understood), although 
even here we cannot give exact answers in all cases, and fields of even size seem particularly strange. 
However, the quasirandom counting lemma has potential to be a powerful tool in the study of 
any Turan problem, whether generalised or standard. For example, in the Turan problem for the 
tetrahedron, if we consider any Kf-fiee 3-graph H and a vertex x then the edges of H cannot be 
quasirandomly distributed with positive density within the triangles of the neighbourhood graph 
N H (x). 
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If we restrict attention only to excluding simple fc-graphs F (meaning that each pair of edges in 
F have at most one common point) then the projective geometries in higher dimensions point to 
one stumbling block that should be overcome in future developments of this method. For example, 
if we consider a 3-graph H on n vertices with no PG^{2) and all but o(n 2 ) codegrees at least 
(2/3 — o(l))n then our results will give a set Z of (2/3 — o(l))n vertices that induce a 3-graph with 
no Fano plane and all but o(n 2 ) codegrees at least (1/2 — o(l))|Z|, so Z is approximately bipartite 
by our structure result. This suggests that PG%{2) should be approximately tripartite, and hence 
an inductive approach for general m and q showing that a (q + l)-graph H with no PG m (q) and all 
but o(n q ) codegrees at least (1 — 1/m — o(l))n should be approximately m-partite. 

A potential approach to filling in the gap is suggested by the remark after Theorem 15.51 If we 
stick to PG^{2) for the sake of simplicity, then not only do we have an approximately bipartite 
subhypergraph of size about (2/3)n, but any set of vertices V obtained by taking some classes of the 
regularity partition contains some approximately bipartite Z' of size about (2/3)|y'|. Thus we are 
faced with the problem of recovering structural information about H from various restrictions, which 
is perhaps best understood in the context of property testing (see |24j for a hypergraph property 
testing result and many references to the literature). Although a full investigation of this idea 
is beyond the scope we have set for this paper introducing our basic method, we remark that it 
should be possible to carry our arguments over to this context via a random reducibility property 
of quasirandom complexes, i.e. that a random restriction of a quasirandom complex to sets of large 
constant size should be quasirandom with high probability: a high-level sketch is that the martingale 
used by Lovasz and Szegedy [IB] to show concentration of the probability that a random map from 
a fixed graph F to a random graph G is a homomorphism may be extended to show concentration 
of the octahedral counting function that appears in the definition of quasirandomness. This will 
allow us to conclude that if m is a large constant and M is a random m-set of vertices then M 
contains an approximately bipartite subhypergraph of size about (2/3)m with high probability (say 
1 — exp — m c for some c > 0). However, even assuming this it is still not clear how to recover 
the global approximate structure of H. If exact results are desired we also have the problem of 
recovering the exact structure from the approximate structure. This seems to be quite a different 
type of question, and so far all instances of its solution have been of a rather ad hoc character, so it 
would be interesting to develop some general principles here as well. 

Acknowledgement. The author thanks Oleg Pikhurko and Yi Zhao for discussions about Mubayi's 
conjecture and the anonymous referees for helpful comments. 
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A Variant forms of the Gowers quasirandomness framework 

In this appendix we justify the variant forms of the decomposition theorem and counting lemma that 
we used in the paper. We first note that the proof given for Theorem 5.1 in [8] also proves Theorem 
I5.2l in this paper: it only helps to assume stronger inequalities for the rji, and our parameter hierarchy 
is such that we can replace |j7o \ J\\e in [8] by e for simpler notation. Then the arguments in section 
6 of [8] go through as written. The proof of Theorem 15.11 is quite similar to the proof of Theorem 
7.3 in [8j. We will not reproduce that proof in full, but will outline it to sufficient extent to explain 
what modifications are needed. First we recall a definition ([8] p. 36): 
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If S = {Si, • ■ ■ ,S S } and T = {T\, ■ • • ,T t } are partitions of the same set U, the mean-square 
density of S with respect to T is 



We also recall Lemma 8. 1 of [8] , which states that if X" is a refinement of T then msdy (S) > msd^ (S) . 

Proof of Theorem 15.11 The proof of Theorem 7.3 in [8] is by means of the following iterative 
procedure. Suppose we have a partition fc-system P (which may be the initial partition or one 
produced by a some number of iterations: we will not complicate notation with a sequence P\, P2, • • •). 
Consider a partition /c-system P* defined by what is called weak equivalence in [Sj: 5, S' € Ka(X) are 
in the same class of P3 exactly when Sb and S' B are in the same class of Pb for every proper subset 
Be A. It is shown (see |8j pp. 38-39) that if ¥ x (P(x) is (e, j, /c)-quasirandom) < 1 — e, then there is 
A € (M) and a refining partition /c-system Q of P, so that (a) Qb = -Pb unless B C A, \B\ = \A\ — 1, 
(b) if B C A, \B\ = \A\ — 1 then Qb is a refinement of Pb where each class of Pb is partitioned into 
at most ca(c,P) further classes, and (c) msdQ* (Qa) > msdp*(P4) + /^(e, P). Here ca and are 
explicitly defined functions that depend only on e and P, and furthermore the dependence of ca on 
P depends only {|Pb| : B C A} and that of /a depends only on {|Pb| : |J3| > \A\}. This property 
of Ja implies that /a(^,Q) = /U(e,P)- Furthermore, this argument still applies in the context of 
our proof, i.e. we have the same conclusion if P(x) is 77-quasirandom and <i-dense with probability 
less than 1 — e. (Note that the functions ca and /a now depend on the functions implicit in the 
<C-notation for the parameter hierarchy.) 

To see that the procedure terminates (with some system of partitions with the required prop- 
erty) introduce a function £p for the system of partitions P, which is defined on (<],) by Cp(^) = 



functions C on (JfJ.) by C < C if there is B 6 (fj.) such that C(B') = ((B) for all B' < B and 
C(B) < C'(B). This is a well-ordering, and the iteration takes the system P to a system Q with 
Cq < Cpi so the procedure terminates. 

To prove our version we introduce further refinements in each step of the procedure. First of all 
we make the general observation that given any partition E = E\ U • • • U E% there is an 'equalising 
method' to find a partition E = E' U E[ U • • • U E' s , for which = [\E\/t 2 \, \E' Q \ < \E\/t and every 
E[ with i 7^ is contained in some Ej. The method is to repeatedly and arbitrarily select classes E^ 
within some Ej that still has size at least |JP|A 2 J) and then remove its elements from consideration 
in later stages. Thus we are unable to use at most \E\/t 2 elements from each of the t original classes 
Ei, and we put the unused elements together in an exceptional class E' Q = E' 1 U • • • U E' Q t of size at 
most \E\/t. 

We start by using the equalising method in an initial refinement to transform P into some 
Q with an equitable partition of the vertex set. By first arbitarily refining P we can assume that 
\Xf\ < e\Xi\/2, 1 < i < r. Then we repeatedly apply the same refinement procedure as above followed 
by the equalising method: at each stage we obtain a new system Q from the original procedure and 
then refine it to some equitable Q' . It is clear that the number of classes remains bounded by 




l-msdp* (Pa) 



Choose an ordering < of (<],) in which \B\ > \B'\ implies that B < B' . Order 



fA(e,P) 
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a function only of m, r, k and e. By Lemma 8.1 of [8] we have msdg/*(Q^) > msdQ* A (Q A ) > 
msdp A (P A ) + f A (e,P). Also, we still have Q B = P B for all B with |5| > \A\, so f A (e,Q) = f A (e,P) 
and the iterations terminate as before. The amounts added to the exceptional classes decrease rapidly 
with each iteration (certainly each is at most half of that at the previous iteration) , and as we initially 
added at most ^elA^I exceptional elements of X{ we end up with at most e|Xj|. □ 
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